Skip to content

The objective from this project are to predict customer churn and provide recommendations to the business team

Notifications You must be signed in to change notification settings

archie-cm/Churn-Analysis-for-Bank-Customer

Repository files navigation

Classification - Churn For Bank Customer

Problem Statement

4

Goal & Obejctive

5

Tools: Python, JupyterLab, Git

Libraries: Pandas, Numpy, Feature-engine, Scikit-learn, Imbalanced-learn, SHAP-learn, Gain & Lift Analysis

Dataset: Predicting Churn for Bank Customers [source]

Summary of the analysis

  • This dataset has 10000 observations and 14 variables with 11 numerical variables, 3 categorical variables and one target variable.
  • All numerical variables have a right-skewed distribution and contain a lot of outliers.
  • Exited is the target variable that labels a 0 (not churn) and 1 (churn). The current condition is 20% of customer churn
  • From exploratory data analysis, customer who use num of products > 2 have trend churn, The older the customer, the higher the churn rate
  • Based on data characteristics, the selected algorithm to build a classification model is tree-based or ensemble. The classification model with the xgboost algorithm is able to correctly predict 75% of visitors who make a purchase.
  • Age, NumOfProduct, Gender Male, Geography France and IsActiveMember are the biggest impact on churn rate.
  • Percentage Saving cost with model have 69%

17

18

What I have learned

  • Framing the business problem.
  • Create a machine learning model and extract insight from it to make an actionable recommendation for the business team.
  • Make a business simulation from insights that decrease churn rate.

File Dictionaries

About

The objective from this project are to predict customer churn and provide recommendations to the business team

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published