This project is the final result of my journey to become Data Scientist. Here, we present a deliverable solution for managing the products of their stores of a company according to its requets.
Requirements:
- To analize the state of each store, the sales trends of the products, and their evolution,possibly by Country.
- Identify similar groups of products; how many exist?
- Forecast of sales.
- Forecast inventory supply.
- We received 3 datasets with information of company(item_prices, item_sales, daily_calendar_with_events)
- Check and clean the datasets, understand the variables and analyze the data.(Worked with
python
using libraries likenumpy
,pandas
,matplotlib
,seaborn
inside ofVisual Studio Code
yJupiter notebooks
) - Preprocessing to build the main dataset; here we created a dataset to show the visualization report and other dataset for clustering.
- Worked in clustering. Used
K-means
. - Choosed the correct model for prediction,
ARIMA
vsskforecast
ofSklearn
.
- Worked in visualization report.
- Make prediction with model choose.
- Worked in storytelling about results.
We develop a dashboard report in Power Bi
showing the stores´ behavior and busines.