Item Outlet Sales Influenced by Unique Factors

The analysis of Item Outlet Sales to Item Type, Item MRP, Item Visibility and Many Others

Jonny Tesfahun:

Business goal:

The goal of the data is to predict sales.

Business problem:

How many Items does an Outlet sell based on Item MRPs?
What are the # of Outlets based on Size?
What is the Avg. Item MRP and Item weight?
What are Item Outlet Sales based on Outlet Size?
What are the Avg. Item Sales based on Outlet Types?
What is Item MRP for different Item Fat Content?
What are the Outlet Sales compared to Outlet Tier Locations?
What are the Item MRP and Outlet Sales based on Outlet Store Type?

Data:

Source of file is included below,
https://github.com/Jonny-T87/Dojo-Projects/blob/main/Predict_Sales_ML_Project_1__Final.ipynb
It is a deep data analysis into sales prediction for food items sold at various outlet stores.

Methods

Cleaned the data for any Errors and or Inconsistencies
Used Histogram, Boxplot, Heatmap, and Pairplot for Visual Exploratory Data Analysis
Incorporated Interactive Visual Plots for closer analysis of data
Utilized Machine Learning - Analytics
Adopting LinearRegression and RandomForestRegressor models to data
Evaluated performance of the models based on R2 score, Mean Squared Error (MSE), and Root Mean Squared Error (RMSE)

Results

The RMSE or Root Mean Squared Error had the least error difference in dollars. An RMSE of 1094, tells us that our model is making an average error of 1,094 dollars on the test data. While, the Random Forest model had an average error of 1,122 dollars on the test data.

Visual 1 - Outlet Sales by Outlet Locations

Showing Outlet Sales in $ by Outlet Locations. Tier 2 and Tier 3 have higher sales than Tier 1 and there is a large difference.

Visual 2 - Item Store Sales and Price by Outlet Type

Grocery Stores have less Sales per Item MRP. Overall, SuperMarket Type 3 has the most Sales per Item MRP.

Model

Based on the 2 different models (linear & Random Forest) tested on the data set. I determined the best model to implement would be the linear regression model.

The most important metrics for this model included the R2 or the coefficient score and RMSE or Root Mean Squared Error.

This model and metrics together would solve many complex business problems, such as increasing the price of one item can decrease sales in another item.

Recommendations:

I would recommend more features be included to the model so that test data predictions can be a little more accurate. This would also avoid a high bias and underfitting of the data.

This model should be used because it accurately predicts 56% of the sales data, which is great since max is just around that. Due, to the correlation of the data.

For further information

For any additional questions, please contact jonnymtesfahun@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
Analytical_ML_Project1Part5.ipynb		Analytical_ML_Project1Part5.ipynb
CleaningData1_Project1Part2.ipynb		CleaningData1_Project1Part2.ipynb
Cleaning_Project_2_Part_2_(Core).ipynb		Cleaning_Project_2_Part_2_(Core).ipynb
Datatsets_Project_2_Part_1_(Core).ipynb		Datatsets_Project_2_Part_1_(Core).ipynb
ExplorData_Project1Part3.ipynb		ExplorData_Project1Part3.ipynb
Filtering_byGender.ipynb		Filtering_byGender.ipynb
Interactive_VisualPresentation_Project1Part4.ipynb		Interactive_VisualPresentation_Project1Part4.ipynb
NBATrends_Project_2_Part_3_(Core).ipynb		NBATrends_Project_2_Part_3_(Core).ipynb
OutletSales_ItemMRP.png		OutletSales_ItemMRP.png
OutletSales_OutletLocation.png		OutletSales_OutletLocation.png
Project1Part1.ipynb		Project1Part1.ipynb
README.md		README.md
VisualPresentation_Project1Part4.ipynb		VisualPresentation_Project1Part4.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Item Outlet Sales Influenced by Unique Factors

The analysis of Item Outlet Sales to Item Type, Item MRP, Item Visibility and Many Others

Business goal:

Business problem:

Data:

Methods

Results

Visual 1 - Outlet Sales by Outlet Locations

Visual 2 - Item Store Sales and Price by Outlet Type

Model

Recommendations:

For further information

About

Releases

Packages

Languages

Jonny-T87/Dojo-Projects

Folders and files

Latest commit

History

Repository files navigation

Item Outlet Sales Influenced by Unique Factors

The analysis of Item Outlet Sales to Item Type, Item MRP, Item Visibility and Many Others

Business goal:

Business problem:

Data:

Methods

Results

Visual 1 - Outlet Sales by Outlet Locations

Visual 2 - Item Store Sales and Price by Outlet Type

Model

Recommendations:

For further information

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages