Cardiovascular Disease Prediction

Problem Definition:

Heart diseases have unfortunately become very common. It may be due to various reasons such as lifestyle, work pressure, lack of exercise, etc.
We are given a set of variables that impact heart diseases. These variables are related to demographic, past, and current medical history. Each attribute is a potential risk factor
The classification goal is to predict whether the patient has a 10-year risk of future coronary heart disease (CHD).

Data Dictionary:

The dataset is taken from an ongoing cardiovascular study on residents of the town of Framingham, Massachusetts.

Demographic:

• Sex: male or female ("M" or "F")

• Age: Age of the patient;(Continuous - Although the recorded ages have been truncated to whole numbers, the concept of age is continuous)

Education :

1 - Higher Secorndary, 2- Graduate, 3 - Post Graduate 4- Doctarte or PHD

Behavioural:

• is_smoking: whether or not the patient is a current smoker ("YES" or "NO") 

• Cigs Per Day: the number of cigarettes that the person smoked on average in one day .(can be considered continuous as one can have any number of cigarettes, even half a cigarette.)

Medical (history):

• BP Meds: whether or not the patient was on blood pressure medication (Nominal)

• Prevalent Stroke: whether or not the patient had previously had a stroke (Nominal)

• Prevalent Hyp: whether or not the patient was hypertensive (Nominal)

• Diabetes: whether or not the patient had diabetes (Nominal)

Medical(current):

• Tot Chol: total cholesterol level (Continuous)

• Sys BP: systolic blood pressure (Continuous) • Dia BP: diastolic blood pressure (Continuous)

• BMI: Body Mass Index (Continuous)

• Heart Rate: heart rate(Continuous - In medical research, variables such as heart rate thought discrete, are considered continuous because of a large number of possible values.)

• Glucose: glucose level (Continuous)

Predict variable (desired target):

• 10-year risk of coronary heart disease CHD (binary: “1”, means “Yes”, “0” means “No”) – DV

Models Used:

Logistic Regression
Random Forest
Support Vector Machine
XGBoost
Voting and stacking classifier

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
CVD_Prediction.ipynb		CVD_Prediction.ipynb
README.md		README.md
data_cardiovascular_risk.csv		data_cardiovascular_risk.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cardiovascular Disease Prediction

Problem Definition:

Data Dictionary:

Demographic:

Education :

Behavioural:

Medical (history):

Medical(current):

Predict variable (desired target):

Models Used:

Final Model:

About

Releases

Packages

Languages

PulkitGigoo99/Cardiovascular_Disease_Prediction

Folders and files

Latest commit

History

Repository files navigation

Cardiovascular Disease Prediction

Problem Definition:

Data Dictionary:

Demographic:

Education :

Behavioural:

Medical (history):

Medical(current):

Predict variable (desired target):

Models Used:

Final Model:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages