A comprehensive project for analyzing and estimating the prices of chocolates based on their quality and company ratings. The project includes data preprocessing, price estimation, and identification of top chocolate companies. Ideal for understanding data manipulation, visualization, and analysis using Python.
data_preprocessing.py
: Preprocesses the chocolate dataset, including cleaning column names, converting data types, and visualizing the distribution of cocoa percentages.price_estimation.py
: Estimates the prices of chocolates based on their cocoa content and ratings, and adjusts prices for chocolates with specific bean types.company_analysis.py
: Analyzes the ratings of different chocolate companies and identifies the top companies based on their average ratings from recent years.