CS611 ML Engineering: Airbus Ship Detection with U-Net and Vertex AI Pipeline

This is the git repo for CS611 ML Engineering project for airbus ship detection.

Group 14 Members:

Wong Songhan
Koh Enyong
Arnold Ng
Gabriel Quek

Problem Statement

In this project, we tackle the problem of identifying ships in satellite images. We recognize 3 main applications for this problem:

Maritime Traffic Management – Improves general situational awareness, especially for small vessels not covered by AIS
Maritime Surveillance & Policing – For detection and tracking of vessels with AIS turned off, which may be engaged in illegal activity
Naval Warfare – An additional source of intelligence for detecting enemy locations

Dataset

The dataset was retrieved from Kaggle based on the Attributes of the dataset:

192,556 images from Airbus Ship Detection Challenge
Each image may have multiple ships
Labels are run-length encoded (RLE), for data compression, need to be converted to single channel image

Visit this Kaggle page for more info

https://www.kaggle.com/c/airbus-ship-detection

Pipeline

Below are the components of our entire pipeline:

1. EDA / Experimentation

We interactively approach the model building and exploration based on the input dataset. Understanding the dataset and problem well before training and building of our model and their respective components.

2. Data Ingest

Due to the complexity of the input dataset and problem itself, preprocessing of the input data is essential to provide good input data for our pipeline.

3. Data Statistics Generation

In this section, we create a component that computes the data statistics.

4. Model Training

Building of model training component that is used by the overall pipeline to be deployed and part of the CI/CD process that retrains the model based on certain triggers.

5. Model Evaluation

Component building of evaluation. Evaluation of the output trained model is conducted. Metrics will be output.

6. Model Deployment

Model is deployed to Vertex AI that is used to serve endpoint.

7. Pipeline Deployment

Stringing together of the pipeline, alongside test components that ensures every component in the pipeline is in order before pushing it to the Vertex AI platform.

8. Model Monitoring

Using the data statistics generated from Step (3), this notebook is used to aassess new data for train-serve drift.

9. Model Serving

This notebook provides a demo of calling RESTful api from Endpoint which returns a model prediction result given an input image.

Overall Pipeline (deployed on Vertex AI)

Project Organization

├── LICENSE
├── README.md          <- The top-level README
├── build 
├── config             <- config file for GCP resource
├── provision          <- terraform config for GCP resource startup  
├── Dockerfile         <- Docker file for custom model trainer
├── saved_models       <- Trained and serialized model data (for exploratory)
│
├── references         <- Data dictionaries, manuals, and all other explanatory materials.
│
├── reports            <- Generated analysis as HTML, PDF, LaTeX, etc.
│   └── figures        <- Generated graphics and figures to be used in reporting
│
├── src                <- Source code for use in this project.
│   ├── __init__.py    <- Makes src a Python module
│   │
│   ├── evaluation     <- Scripts to generate model evaluation component
│   │   └── eval_component.py
│   │
│   │
│   ├── model_training  <- Scripts for custom model training
│   │   
│   │── models  <- Preprocessing scripts  
│   │   
│   └── utils  <- Common util scripts for data ingest and pre-processing
│       └── common.py
│       └── dataset.py
│
└── tox.ini            <- tox file with settings for running tox; see tox.readthedocs.io

Project based on the cookiecutter data science project template. #cookiecutterdatascience

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

CS611 ML Engineering: Airbus Ship Detection with U-Net and Vertex AI Pipeline

Group 14 Members:

Problem Statement

Dataset

Pipeline

1. EDA / Experimentation

2. Data Ingest

3. Data Statistics Generation

4. Model Training

5. Model Evaluation

6. Model Deployment

7. Pipeline Deployment

8. Model Monitoring

9. Model Serving

Overall Pipeline (deployed on Vertex AI)

Project Organization

Files

README.md

Latest commit

History

README.md

File metadata and controls

CS611 ML Engineering: Airbus Ship Detection with U-Net and Vertex AI Pipeline

Group 14 Members:

Problem Statement

Dataset

Pipeline

1. EDA / Experimentation

2. Data Ingest

3. Data Statistics Generation

4. Model Training

5. Model Evaluation

6. Model Deployment

7. Pipeline Deployment

8. Model Monitoring

9. Model Serving

Overall Pipeline (deployed on Vertex AI)

Project Organization