Impute-missing-data-with-KNNImputer-and-IterativeImputer

When signaficant amount of data are missing, what can we do? Impute the missing data with mean or median? That will be a diaster. Actually, Scikit-Learn provides two powerful imputers, KNNImputer and IterativeImputer. The former imputes missing data using the mean value from n_neighbors nearest neighbors found in the training set, and the latter is inspired by R's MICE package and imputes missing values by modeling each feature with missing values as a function of other features in a round-robin fashion.

Before imputation, there are significant amount of "Cost", a few "Weight", and many "Ingredient Number" data missing in the dataset.

After imputation, all the columns are filled.

Let's have a look at the imputation effect. Amazing!

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Impute missing data with KNNImputer and IterativeImputer.ipynb		Impute missing data with KNNImputer and IterativeImputer.ipynb
README.md		README.md
data.csv		data.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Impute-missing-data-with-KNNImputer-and-IterativeImputer

About

Releases

Packages

Languages

hanfei1986/Impute-missing-data-with-KNNImputer-and-IterativeImputer

Folders and files

Latest commit

History

Repository files navigation

Impute-missing-data-with-KNNImputer-and-IterativeImputer

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages