⛩️ Detecting Fraudulent Financial Transactions with ZenML

This repository contains Two's Solution to the ZenML Month of MLOps Competition.

The aim of this project is to develop a production-ready ML application for fraud detection using the ZenML MLOps framework. To train our fraud detection model, we make use of the "Synthetic data from a financial payment system" Dataset available on Kaggle.

📝 Solution Overview

This repository contains an end-to-end ML solution using ZenML, which covers the following responsiblities:

Importing the Dataset
Cleaning the data & engineering informative features
Detecting data drift of new data
Training a model to detect fraud on a transactional level
Evaluating the performance of the model
Deploying the model to a REST API endpoint
Providing an interface for users to interact with the model

To address these requirements, we built a Training Pipeline, which we used for experimentation, and a Continuous Deployment Pipeline, which extended the capabilities of the Training Pipeline to identify data drift in new data, train a model on all available data, and evaluate the performance of this model prior to deploying this to an API endpoint.

To enable the aforementioned pipelines, we made use of the following ZenML Stack:

Artifact Storage: Google Cloud Storage

Container Registry: Google Cloud Container Registry

Data Validator: EvidentlyAI

Experiment Tracker: MLFlow

Orchestrator: Google Kuberenetes Engine

Model Deployer: Seldon

🔧 Usage

There are a number of ways of interacting with the code in this repository:

Executing the Training & Continuous Deployment Pipelines
Running the Streamlit App
Running the Tests

Executing the Training & Continuous Deployment Pipelines

Ensure you have Python 3.9 installed on your machine
Install the development requirements:

~ $ pip install -r test-requirements.txt

Deploy and register the ZenML stack described in the Solution Overview
Create an .env file from the .env.example template
To execute the train pipeline:

~ $ python src/run_train_pipeline.py

To execute the deployment pipeline:

~ $ python src/run_deployment_pipeline.py

Running the Streamlit App

The Streamlit application entrypoint is the app.py file at the root of the repository. We have deployed this app to Streamlit Cloud.

To recreate the app on your local machine, you must:

Ensure you have Python 3.9 installed on your machine
Install the Streamlit requirements:

~ $ pip install -r requirements.txt

Create an .env file according to the .env.example template
Deploy the Streamlit application

~ $ streamlit run app.py

Running the Tests

Ensure you have Python 3.9 installed on your machine
Install the test requirements:

~ $ pip install -r test-requirements.txt

Execute tests using pytest

~ $ pytest

📁 Repository Structure

├── .github				<- CI Pipeline Definition
├── src
│   ├── pipelines			<- Pipeline Definition
│   │   ├── ...
│   ├── steps		  		<- Step Definitons
│   │   ├── ...
│   ├── util		 		<- Utility Definitions
│   │   ├── ...
│   ├── data_exploration.ipynb		<- Data Exploration Notebook
│   ├── feature_engineering.ipynb	<- Feature Engineering Experimentation Notebook
│   ├── run_deployment_pipeline.py	<- Deployment Pipeline Execution script
│   ├── run_train_pipeline.py		<- Training Pipeline Execution Script
├── tests
│   ├── util				<- Utility Function Tests
│   │   ├── ...
├── app.py 	   			<- Streamlit App
├── docker-requirements.txt 		<- Step Container Dependencies
├── notebook-requirements.txt 		<- Notebook Dependencies
├── requirements.txt   			<- Streamlit App Dependencies
├── test-requirements.txt 		<- Development Dependencies

Name		Name	Last commit message	Last commit date
Latest commit History 167 Commits
.github/workflows		.github/workflows
src		src
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
app.py		app.py
docker-requirements.txt		docker-requirements.txt
notebook-requirements.txt		notebook-requirements.txt
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
test-requirements.txt		test-requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

⛩️ Detecting Fraudulent Financial Transactions with ZenML

📝 Solution Overview

🔧 Usage

Executing the Training & Continuous Deployment Pipelines

Running the Streamlit App

Running the Tests

📁 Repository Structure

🧑‍💻 Competition Participants

About

Releases

Packages

Contributors 4

Languages

two-inc/zenml-competition

Folders and files

Latest commit

History

Repository files navigation

⛩️ Detecting Fraudulent Financial Transactions with ZenML

📝 Solution Overview

🔧 Usage

Executing the Training & Continuous Deployment Pipelines

Running the Streamlit App

Running the Tests

📁 Repository Structure

🧑‍💻 Competition Participants

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages