Overview

Deploying a Machine Learning Model on Heroku with FastAPI

Training a Random Forest classification model to predict the income category of a person based on other personal informations.

Created unit tests to monitor the model performance on various slices of the data.
Then, deployed the model using the FastAPI package and create API tests.
Both the slice-validation and the API tests were incorporated into a CI/CD framework using GitHub Actions.
UCI census datasets was used to experience updating the dataset and model in git and DVC.

Repositories

Link to the project repo

Data

The training data is the census data available at the UCI library. It is the adult.income data from the data folder.
Link: UCI Census Data

This data versioning is tracked through DVC using AWS S3 bucket as remote storage.

Model

A basic Random Forest classifier imported from scikit-learn library and fit onto the census data Model parameters are(other than default): { "random_state": 8, "max_depth": 16, "n_estimators":128 }

Refer to the - model card

The model versioning was tracked using dvc.
Also, the performance of the model was evaluated on a slice of data (code). The results are stored in the slice_output.txt for slices done on education and race.

To run the model trainer, evaluation code (link):

python main.py

Github action

Continuous integration was incorporated in the project using the Github actions. The action was completed only if the pytest and flake8 linter tests passed on the project without any error.

Unit tests

Unit tests were written for the model training and the inference API features. The tests are done using the pytest library via command:

pytest test/ -vv

API Creation

A FastAPI framework was developed for the inference API using the input type-hints example from pydantic library The API main file is the inference_api.py

API Deployment

The API was deployed on Heroku using the main branch of the current GitHub repository with Continuous Delivery enabled.

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
.dvc		.dvc
.github/workflows		.github/workflows
data		data
model		model
screenshots		screenshots
starter		starter
test		test
.dvcignore		.dvcignore
.gitignore		.gitignore
.slugignore		.slugignore
Aptfile		Aptfile
LICENSE.txt		LICENSE.txt
Procfile		Procfile
README.md		README.md
dvc_on_heroku_instructions.md		dvc_on_heroku_instructions.md
eda.ipynb		eda.ipynb
inference_api.py		inference_api.py
main.py		main.py
model_card_template.md		model_card_template.md
request_app.py		request_app.py
requirements.txt		requirements.txt
runtime.txt		runtime.txt
setup.py		setup.py
slice_output.txt		slice_output.txt
test_slice.py		test_slice.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Repositories

Data

Model

Github action

Unit tests

API Creation

API Deployment

About

Releases

Packages

Languages

License

VineetKT/ML_fastapi_on_Heroku_CI-CD

Folders and files

Latest commit

History

Repository files navigation

Overview

Repositories

Data

Model

Github action

Unit tests

API Creation

API Deployment

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages