This project involves performing exploratory data analysis (EDA) to understand the data and identify patterns in online retail sales. It starts with cleaning and pre-processing the dataset to make it refined.
The EDA component of this project focuses on understanding the structure and distribution of the data, identifying patterns, and visualizing key insights.
For a comprehensive overview of the entire project, including customer retention, purchase trends, CLV, and customer patronage forecast, please refer to the Online Retail Sales Data Analysis.
scripts/
: This directory contains the scripts for data cleaning and pre-processing, as well as the key insights including most valuable items and invoices, monthly revenue, loyal countries and customers, most purchased items and correlation plot- Data: Raw and cleaned data used for the EDA can be found in the
data/
directory of the primary repository.
For detailed documentation on the EDA process, see the EDA Documentation.
The conversation continues on Kaggle