GitHub - 10-kp/cust_churn_project: This project focuses on analyzing customer churn data from a telecom company. The analysis involves various stages of data manipulation, visualization, and machine learning to predict whether a customer is likely to churn based on their demographic information, service usage, and billing details.

Overview

This project aims to predict customer churn for a telecom company using various machine learning models. Customer churn occurs when customers stop doing business with a company, and predicting churn helps companies take proactive measures to retain customers.

Dataset

The dataset contains customer information such as demographic details, services subscribed, billing information, and whether the customer churned or not. Key columns include:

customerID: Unique identifier for each customer.
gender: Gender of the customer.
SeniorCitizen: Whether the customer is a senior citizen (1 for Yes, 0 for No).
Partner: Whether the customer has a partner.
Dependents: Whether the customer has dependents.
tenure: Number of months the customer has stayed with the company.
PhoneService: Whether the customer has a phone service.
InternetService: Type of internet service (DSL, Fiber optic, No).
Churn: Target variable indicating if the customer churned (Yes/No).

Project Structure

data/: Directory containing the dataset (customer_churn.csv).
notebooks/: Jupyter notebooks with data exploration, preprocessing, and model training.
models/: Directory to save trained models.

Installation

To get started with this project, follow these steps:

Clone the repository:

git clone (https://github.com/10-kp/cust_churn_project)
cd customer-churn-prediction

Install the required packages:

pip install -r requirements.txt

Run the Jupyter notebook:

jupyter notebook notebooks/Customer_Churn_Analysis.ipynb

Usage

Data Preprocessing:
- Handle missing values.
- Convert categorical variables into numeric form using Label Encoding.
- Check for multicollinearity using Variance Inflation Factor (VIF) and remove highly correlated features.
Model Building:
- Split the dataset into training and testing sets.
- Train and evaluate Logistic Regression, Decision Tree, and Random Forest models.
- Measure model performance using accuracy scores.
Feature Engineering:
- Experiment with feature selection and engineering to improve model performance.
Evaluation:
- Evaluate the models using accuracy, confusion matrix, and other relevant metrics.
- Compare the performance of different models.

Results

Logistic Regression: Achieved an accuracy of 76.7%.
Decision Tree: Achieved an accuracy of 73.0%.
Random Forest: Achieved an accuracy of 76.8%.

The Random Forest model performed the best, making it a strong candidate for predicting customer churn.

Future Work

Experiment with more advanced models like XGBoost.
Perform hyperparameter tuning to further improve model performance.
Analyze feature importance to understand which features contribute most to churn prediction.

Contributing

Contributions are welcome! If you have suggestions or improvements, feel free to create a pull request or open an issue.

License

This project is licensed under the MIT License. See the LICENSE file for more details.

Acknowledgements

[Pandas Documentation] (https://pandas.pydata.org/docs/index.html)
[Scikit-learn Documentation] (https://scikit-learn.org/stable/)
[Seaborn Documentation] (https://seaborn.pydata.org/)

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Dataset		Dataset
Cust_Churn_Proj.ipynb		Cust_Churn_Proj.ipynb
MIT-LICENSE.txt		MIT-LICENSE.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Dataset

Project Structure

Installation

Usage

Results

Future Work

Contributing

License

Acknowledgements

About

Releases

Packages

Languages

License

10-kp/cust_churn_project

Folders and files

Latest commit

History

Repository files navigation

Overview

Dataset

Project Structure

Installation

Usage

Results

Future Work

Contributing

License

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages