Skip to content

Latest commit

 

History

History
73 lines (50 loc) · 4.18 KB

File metadata and controls

73 lines (50 loc) · 4.18 KB

Cohort Analysis and Customer Segmentation

Table of Content

Introduction

An e-commerce giant wants to improve its product offerings, and customer relations and also maximize profit. To achieve this, the company wants to calculate the percentage of customers it's able to retain every month. The company also needs to segment its customers for target marketing and promotions.

Problem Statment

  1. Calculate the percentage of customers the company is able to retain every month.
  2. Segment the customers based on their behaviour. The behaviour under consideration includes customers' recent transactions with the company, the frequency of purchases, and how much each customer spends on the company products.
  3. Analyze customer segments based on Recency, Frequency, and Monetary value.

Objectives

The objectives of the project are:

  1. To calculate customer retention rate.
  2. To segment the customers based on their behaviour.
  3. To analyze customer segments based on Recency, Frequency, and Monetary value with Power BI.

Metrics Used

Recency (R) refers to the days elapsed since the last purchase. It is calculated by deducting the date that the customer made the last purchase from the offset date (the assumed date when the analysis was carried out).

Frequency (F) refers to the total number of transactions a customer has made with the company. It represents how often a customer purchases a product/product from the company. It is calculated by counting the number of complete purchases a customer has made with the company.

Monetary (M) refers to the sum of a customer's monies on the company's products. It is calculated by multiplying the number of products by the unit price of products.

Tools Used

The project was implemented using Python programming language and the following software and packages (libraries) were used:

Softwares

  • Jupyter Notebook
  • Power BI

Libraries

  • Sickit-Learn
  • Pandas
  • NumPy
  • Matplotlib
  • Seaborn

Project Details

  • Cohort Analysis: Time-based cohort analysis was performed to calculate and visualize the retention rate for each cohort. This involves grouping the data frame by "CohortMonth" and "CohortIndex" columns, selecting the first column storing it to cohort_sizes and then dividing the cohort count by cohort sizes along the rows.
  • Customer Segmentation: The customers were segmented based on their behaviour which includes recency, frequency, and monetary value. RFM analysis was performed to determine the customer segments. The data was visualized using a heatmap to show the relationship between recency, frequency, and monetary value. To determine each segment in terms of Platinum, Gold, Bronze, and Silver, the metric values were clustered and visualized using a snake plot.
  • Power BI Customer Segmentation Analysis: With the help of Power BI, the segmented customer base Platinum , Gold , Bronze , and Silver was further analyzed in order to gain a deeper understanding of the interactions between distinct clusters and the company through an interactive dashboard.

Conclusion

The project has been successful in calculating customer retention rate, segmenting customers based on their behaviour, and analyzing customer segments based on Recency, Frequency, and Monetary value. The results obtained from the project can be used to make informed product decisions that will reduce churn and drastically increase revenue.