Predicting the Hyper-local Prevalence of Chronic Kidney Disease

Chronic kidney disease (CKD) has been on the rise in recent years and is a major cause of mortality and health expenditure in the United States. This project uses 235 features extracted from the U.S. Census Bureau to test whether hyper-local rates of CKD can be determined using readily available demographic data. These features include data on age, sex, marital status, disability, employment, profession, household type, housing costs, and type of insurance. Regression and ensemble methods were used to predict rates of chronic kidney disease. Ultimately, gradient boosted decision trees proved to be the best prediction model with a predictive accuracy of 83.94% (adjusted R²).

The purpose of this project was to assist federal, state, and local public health agencies and organization to improve targeting of public health campaigns related to chronic kidney disease prevention. The predictive model helps to accomplish this goal by allowing limited resources to be targeted to neighborhoods with the greatest need for intervention.

Data Sources

Final Report: A summary of the project process, results, and actionable insights.
Slide Deck: Used for presenting findings
Notebooks: These were used in the following order to create the code base for this project.
1. Data Wrangling: collecting, organizing, and cleaning datasets
2. Data Storytelling: using exploratory data analysis to tell a story about the data
3. Exploratory Data Analysis: exploring the data for initial insights, correlations, and possibly important features
4. Regression Analysis: using various regression and ensemble methods to predict CKD prevalence
Reports: These reports were written to track progress and explain the process throughout the project.
Images: All saved plot and map outputs

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
img		img
notebooks		notebooks
reports		reports
.gitignore		.gitignore
Capstone_1_Report.pdf		Capstone_1_Report.pdf
Predicting_CKD_XGB.ipynb		Predicting_CKD_XGB.ipynb
Predicting_CKD_slidedeck.pdf		Predicting_CKD_slidedeck.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predicting the Hyper-local Prevalence of Chronic Kidney Disease

Table of Contents

About

Releases

Packages

Languages

TheeChris/predicting_ckd

Folders and files

Latest commit

History

Repository files navigation

Predicting the Hyper-local Prevalence of Chronic Kidney Disease

Table of Contents

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages