here you can download the cheatsheet 🏬
This data set is created only for the learning purpose of the customer segmentation concepts , also known as market basket analysis . I will demonstrate this by using unsupervised ML technique (KMeans Clustering Algorithm) in the simplest form.
- Language : Python (filetype: .ipynb)
You own the mall and want to understand the customers like who can be easily converge [Target Customers] so that the sense can be given to marketing team and plan the strategy accordingly.
Variable Name | Description |
---|---|
Customer ID |
Unique ID assigned to the customer |
Gender |
Gender of the customer (Male/Female) |
Age |
Age of the customer |
Annual Income (k$) |
Annual Income of the customee |
Spending Score |
Score assigned by the mall based on customer behavior and spending nature |
By the end of this case study , you would be able to answer below questions.
- How to achieve customer segmentation using machine learning algorithm (KMeans Clustering) in Python in simplest way.
- Who are your target customers with whom you can start marketing strategy.
- How the marketing strategy works in real world.
- Exploratory Data Analysis (EDA)
- Clustering (4 Variables)
- Clustering (Age & Annual Income & Spending Score)
- Clustering (Age & Annual Income)
- Clustering (Annual Income & Spending Score)
- Clustering (Age & Spending Score)
- It seems very clear that there is no big difference between male and female customers, so a gender-based audience should not be chosen.
- In addition, it seems that the audience between the ages of 20-40 spend more in this store compared to people in other age groups, making special campaigns for the audience between the ages of 20-40 can increase the profit of the supermarket.
- This is not the optimal strategy, but it could be an alternative. Since the average spending scores of middle-income (40k-70k dollars) customers in this store are also at a medium level, it is difficult to increase their spending to higher levels because their income is not conducive to this, but by making campaigns to increase the number of these customers, the store can increase its profit by acquiring more middle-income customers.
- I think the best strategy would be to target high-income customers. The reason is that some of the high-income customers spend high, while a significant portion of these customers spend low, there may be some things that low-spenders are not satisfied. Improvements to be made in service and quality can increase the spending of high-income customers who come to the store, but do not.
- The distribution of the data was generally good, but the standard deviations were a little high and there was no significant positive correlation between the data, only a negative correlation between age and spending score that could be important, showing us that older people who choose this supermarket spend less money than people in other age groups.