Customer Segmentation - Arvato Bertelsmann

Installation

All the necessary libraries to run the code were already available in the Anaconda distribution of Python, except:

scikit-optimize, which can be installed through !pip install scikit-optimize;
lightgbm, which can be installed through !pip install lightgbm.

This script was written using Python version 3.*.

Project Motivation

This was the chosen project to be developed as the Capstone Project for the Udacity Data Scientist Nanodegree. I am passionate about data, and especially data related to people somehow.

Customer Segmentation is all about understanding people, how they behave, what they like, how they think, and so on. I really enjoy putting together the pieces of this puzzle.

Besides that, I felt like it would be a great challenge to work on real data, giving me the chance to overcome the problems that come up in the daily life of any data scientist.

File Descriptions

Notebook - Jupyter Notebook with the script containing the whole solution of the project.
joblib files (azdias, customers, train, and test) - compacted versions of the datasets used in this project.

Results

All the process as well as the results and findings were documented in this Medium post.

Licensing, Authors, Acknowledgements

Credits must be given to the Arvato Bertelsmann company for providing the data, and also Udacity for proposing this amazing Capstone Project.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Customer Segmentation - Arvato Bertelsmann

Table of Contents

Installation

Project Motivation

File Descriptions

Results

Licensing, Authors, Acknowledgements

Files

README.md

Latest commit

History

README.md

File metadata and controls

Customer Segmentation - Arvato Bertelsmann

Table of Contents

Installation

Project Motivation

File Descriptions

Results

Licensing, Authors, Acknowledgements