Project-ML-Data-Preprocessing

The main objective of this project is to design and implement a robust data preprocessing system that addresses common challenges such as missing values, outliers, inconsistent formatting, and noise. By performing effective data preprocessing, the project aims to enhance the quality, reliability, and usefulness of the data for machine learning.

Fulfilled Key Components:

Data Exploration:

Explore the data, list down the unique values in each feature and find its length. Perform the statistical analysis and renaming of the columns.

Data Cleaning:

Find the missing and inappropriate values, treat them appropriately. Remove all duplicate rows. Find the outliers. Replace the value 0 in age as NaN Treat the null values in all columns using any measures(removing/ replace the values with mean/median/mode)

Data Analysis:

Filter the data with age >40 and salary<5000 Plot the chart with age and salary Count the number of people from each place and represent it visually

Data Encoding:

Convert categorical variables into numerical representations using techniques such as one-hot encoding, label encoding, making them suitable for analysis by machine learning algorithms.

Feature Scaling:

After the process of encoding, perform the scaling of the features using standardscaler and minmaxscaler.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Employee.csv		Employee.csv
LICENSE		LICENSE
Project_ML_Data_Preprocessing.ipynb		Project_ML_Data_Preprocessing.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project-ML-Data-Preprocessing

Fulfilled Key Components:

Data Exploration:

Data Cleaning:

Data Analysis:

Data Encoding:

Feature Scaling:

About

Releases

Packages

Languages

License

aneeshmurali-n/Project-ML-Data-Preprocessing

Folders and files

Latest commit

History

Repository files navigation

Project-ML-Data-Preprocessing

Fulfilled Key Components:

Data Exploration:

Data Cleaning:

Data Analysis:

Data Encoding:

Feature Scaling:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages