Skip to content

This data set is created only for the learning purpose of the customer segmentation concepts , also known as market basket analysis . I will demonstrate this by using unsupervised ML technique (KMeans Clustering Algorithm) in the simplest form

Notifications You must be signed in to change notification settings

arienugroho050396/Mall-Customer-Segmentation-Unsupervised-ML

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 

Repository files navigation

This is an image here you can download the cheatsheet 🏬

Introduction

This data set is created only for the learning purpose of the customer segmentation concepts , also known as market basket analysis . I will demonstrate this by using unsupervised ML technique (KMeans Clustering Algorithm) in the simplest form.

Technical

  • Language : Python (filetype: .ipynb)

Content

You own the mall and want to understand the customers like who can be easily converge [Target Customers] so that the sense can be given to marketing team and plan the strategy accordingly.

Data Fields :

Variable Name Description
Customer ID Unique ID assigned to the customer
Gender Gender of the customer (Male/Female)
Age Age of the customer
Annual Income (k$) Annual Income of the customee
Spending Score Score assigned by the mall based on customer behavior and spending nature

What We Got From This Program

By the end of this case study , you would be able to answer below questions.

  • How to achieve customer segmentation using machine learning algorithm (KMeans Clustering) in Python in simplest way.
  • Who are your target customers with whom you can start marketing strategy.
  • How the marketing strategy works in real world.

Step Inside The Program

  • Exploratory Data Analysis (EDA)
  • Clustering (4 Variables)
  • Clustering (Age & Annual Income & Spending Score)
  • Clustering (Age & Annual Income)
  • Clustering (Annual Income & Spending Score)
  • Clustering (Age & Spending Score)

Conclusion

  • It seems very clear that there is no big difference between male and female customers, so a gender-based audience should not be chosen.
  • In addition, it seems that the audience between the ages of 20-40 spend more in this store compared to people in other age groups, making special campaigns for the audience between the ages of 20-40 can increase the profit of the supermarket.
  • This is not the optimal strategy, but it could be an alternative. Since the average spending scores of middle-income (40k-70k dollars) customers in this store are also at a medium level, it is difficult to increase their spending to higher levels because their income is not conducive to this, but by making campaigns to increase the number of these customers, the store can increase its profit by acquiring more middle-income customers.
  • I think the best strategy would be to target high-income customers. The reason is that some of the high-income customers spend high, while a significant portion of these customers spend low, there may be some things that low-spenders are not satisfied. Improvements to be made in service and quality can increase the spending of high-income customers who come to the store, but do not.
  • The distribution of the data was generally good, but the standard deviations were a little high and there was no significant positive correlation between the data, only a negative correlation between age and spending score that could be important, showing us that older people who choose this supermarket spend less money than people in other age groups.

About

This data set is created only for the learning purpose of the customer segmentation concepts , also known as market basket analysis . I will demonstrate this by using unsupervised ML technique (KMeans Clustering Algorithm) in the simplest form

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published