WhizML-CVD_Analysis

This repository illustrates an application of the WhizML codebase for an analysis of cardiovascular disease risk.

About the Dataset

The dataset used is the Cardiovascular Diseases Risk Prediction dataset obtained from Kaggle.

Auto-EDA

Running the eda pipeline will launch the following Auto-EDA dashboard, allowing the users to observe the dataset.

Data Preprocessing

Users can implement custom functions to preprocess the data. In our case, the preprocessing codes can be found in Data_Preprocessing.ipynb, inside the notebooks directory.

ML Models Experimentation

The model_experimentation triggered the training of various Logistic Regression, Random Forest, and XGBoost models.

Model Explainability

Model explainability can be further explored using the model_explainability pipeline.

Bias Analysis

Bias analysis can be performed using the Aequitas web app, with the data provided by using the bias_analysis_data_prep pipeline.

Data Drift Analysis

As new data is obtained, drift detection can be performed using the data_drift_analysis pipeline.

Note: To create a hypothetical example, some rows were sampled from the original dataset and were assumed to be the new data.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
images		images
notebooks		notebooks
src		src
.DS_Store		.DS_Store
.Rhistory		.Rhistory
LICENSE		LICENSE
README.md		README.md
config.yml		config.yml
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WhizML-CVD_Analysis

About the Dataset

Auto-EDA

Data Preprocessing

ML Models Experimentation

Model Explainability

Bias Analysis

Data Drift Analysis

About

Releases

Packages

Languages

License

Gianatmaja/WhizML-CVD_Analysis

Folders and files

Latest commit

History

Repository files navigation

WhizML-CVD_Analysis

About the Dataset

Auto-EDA

Data Preprocessing

ML Models Experimentation

Model Explainability

Bias Analysis

Data Drift Analysis

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages