Pet Product Subcategorization by Review Analysis
20,403 pet products were subcategorized into the three-level nested categories (Animal category, Big category, and Small category) by hierarchical agglomerative clustering. This project includes data acquisition, data preprocessing, EDA, and clustering.
The result of this project can be used for product classifications on the website of an e-commerce company and for extracting a specific group of products to analyze them closely; for example, the variety, popular products in the group, consumer needs, and others.
☆ Preprocessing: CapstoneProject1_codes_preprocessing.ipynb
☆ Clustering: CapstoneProject1_codes_clustering.ipynb
Project proposal: CapstoneProject1_ProjectProposal.pdf
Data Wrangling: CapstoneProject1_DataWranglingReport.pdf
Statistical Analysis: CapstoneProject1_StatisticalAnalysisReport.pdf
Milestone report: CapstoneProject1_MilestoneReport.pdf
In-Depth Analysis: CapstoneProject1_In-DepthAnalysisReport.pdf
☆ Final report: CapstoneProject1_ConsolidatedReport.pdf
☆ CapstonProject1_PetProductSubcategorization
This project was conducted as a part of the Data Science Career Track Course at Springboard. My deepest appreciation goes to them, especially to my mentor, who gave me constructive comments and warm encouragement.