ML_Project

Deplying Machine Learning Algorithms to predict the occurance of stroke in a person.

Problem Statement

Stroke is the second largest cause of mortality worldwide and remains an enormous health burden for individuals. Hypertension, heart illness, diabetes and dysregulation of glucose metabolism, atrial fibrillation, and lifestyle variables are some of the controllable risk factors of stroke. The objective of our project is to successfully predict a person’s likelihood of suffering a stroke based on potentially modifiable risk variables by applying machine learning methods to big data sets. This may be done by analyzing medical records using machine learning models to find patterns that are related to the risk of stroke.

About the dataset

The dataset we use consists of 5111 rows and 12 columns.
● Id : Unique Identifier.
● Gender : "Male", "Female" or "Other".
● Age : Age of the patient.
● Hypertension : 1 if the patient has hypertension or 0 if not.
● Heart_disease : 1 if the patient has a heart disease or 0 if not.
● Ever_married : "No" or "Yes".
● Work_type : "Never_worked", "Children", "Govt_job", "Private", “Self_emp".
● Residence_type : "Rural" or "Urban".
● Avg_glucose_level : Average glucose level in blood.
● Bmi : body mass index.
● Smoking_status : "Formerly smoked", "Never smoked", "Smokes" or "Unknown".
● Stroke : 1 if the patient will have a stroke or 0 if not.

ML Algorithms Used

Decision Trees
Random Forest
Logistic Regression
Naive Bayes
SVM
KNN
Neural Networks MLP

Libraries and Frameworks Needed

scikit-learn
numpy
mathplotlib
seaborn

SETUP

1.Download the repository usiing the "git clone" command.
2.Upload the "stroke_prediction.ipynb" file to google colab.
3.Upload the "healthcare-dataset-stroke-data.csv" file to the runtime.
4.Copy and Paste the relative path of the dataset to """ stroke_data=pd.read_csv('/healthcare-dataset-stroke-data.csv') """
5.Run the colab file.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
ML Lab - Report Team7.pdf		ML Lab - Report Team7.pdf
README.md		README.md
healthcare-dataset-stroke-data.csv		healthcare-dataset-stroke-data.csv
stroke_prediction.ipynb		stroke_prediction.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ML_Project

Problem Statement

About the dataset

ML Algorithms Used

Libraries and Frameworks Needed

SETUP

About

Releases

Packages

Languages

License

NIRMAL1508/STROKE-DISEASE-PREDICTION

Folders and files

Latest commit

History

Repository files navigation

ML_Project

Problem Statement

About the dataset

ML Algorithms Used

Libraries and Frameworks Needed

SETUP

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages