Data Scientist Infrastructures

This project sets up a data scientist workbench with :

MLFlow for experiment tracking and model asset management;
PostgreSQL for a SQL engine and to serve as a backend for MLFlow;
MinIO to mimic S3 and act as an artifact and data store;
Jupyterlab as an EDA environment.

Prerequisites

Install and Configure WSL 2:
- Ensure that WSL 2 is installed and properly configured on your system.
- You can follow the Microsoft instructions to install WSL 2 here.
Install Docker Desktop for Windows:
- Download and install Docker Desktop from the official Docker website.
Configure Docker to Use WSL 2:
- Open Docker Desktop and go to the settings.
- Under the "General" tab, ensure that "Use the WSL 2 based engine" is checked.
- Under the "Resources" > "WSL Integration" tab, ensure that your WSL 2 distribution is checked.
- Under the "General" tab, check "Expose daemon on tcp://localhost:2375 without TLS".

Setup

Clone the Repository

git clone https://github.com/sawadogosalif/DS-backbone.git
cd DS-backbone

Configure Environment Variables Update a default.env file in the root directory with the following variables as you want.

Build and Run Services

docker-compose --env-file default.env up -d

Usage if use reverse proxy NGINX

JupyterLab: http://sawalle.ds.notebooks
MLflow: http://sawalle.ds.mlflow
MinIO: http://sawalle.ds.s3

Usage if use localhost

JupyterLab: http:localhost:8888
MLflow: http:localhost:5555
MinIO: http:localhost:9000 Additionally, in the file notebooks/tracking_example.py, we demonstrate how to use MLflow efficiently.

Services

PostgreSQL

Image: postgres:11
Port: 5432

MinIO (S3)

Image: minio/minio:RELEASE.2020-12-18T03-27-42Z
Port: 9000

MLflow

Port: 5000

JupyterLab

Port: 8888
Image: jupyter/datascience-notebook:latest

Nginx

Port: 80
Image: nginx:1.25.5

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Docker		Docker
app		app
docs		docs
jupyterlab		jupyterlab
mlflow		mlflow
nginx		nginx
notebooks		notebooks
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
credentials.env		credentials.env
default.env		default.env
docker-compose.yml		docker-compose.yml
how-to-github.md		how-to-github.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Scientist Infrastructures

Prerequisites

Setup

Usage if use reverse proxy NGINX

Usage if use localhost

Services

PostgreSQL

MinIO (S3)

MLflow

JupyterLab

Nginx

About

Releases

Packages

Languages

License

sawadogosalif/DS-backbone

Folders and files

Latest commit

History

Repository files navigation

Data Scientist Infrastructures

Prerequisites

Setup

Usage if use reverse proxy NGINX

Usage if use localhost

Services

PostgreSQL

MinIO (S3)

MLflow

JupyterLab

Nginx

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages