Skip to content

It contains a predictive machine learning model for a standard housing data

Notifications You must be signed in to change notification settings

Yas2020/Regression

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 

Repository files navigation

This notebook contains EDA and modeling details for housing price prediction on Ames Housing dataset with 79 explanatory variables describing (almost) every aspect of residential homes in Ames, Iowa. This dataset is an incredible alternative for a modernized and expanded version of the often cited Boston Housing dataset. First data is downloaded from Kaggle, missing data is handled, categorical data is encoded and then, some features with very low variances are removed. Next, some basic new features are created or substituted old features. After that, more feature extraction is performed using supervised dimensionality reduction followed by a KNN to produce a new feature for the final XGBOOST model. To have a more stable final model, an forest XGBOOST (of 20 parallel trees at each boosting step) is trained and finally the predictions are submitted to Kaggle platform. At the time of submission, the ranking was in the top 4%.

About

It contains a predictive machine learning model for a standard housing data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published