This is the final assignment using Power BI, from the Data Visualization course at the Big Data & Analytics Masters @ EAE class of 2021. The assignment uses mocked-up data as input provided by the professor:
- *.txt files in comma-separated-values format, that mimic Heineken Sales data from years 2015 and 2016
- *.txt files in comma-separated-values format, that mimic Heineken Sales Budget data from the same time period.
- *.txt files in comma-separated-values format, that mimic Nielsen overall market Sales data from the same time period.
- *.xls files that represent Customer, Product and Salesman master data. The assignment purpose is to provide sales insights and recommendations, backed up by visualizations.
Professors:
Team:
- Removed exact duplicates creating a row hash identity column.
- Remove rows having alphanumeric Product Codes, as they were proven invalid.
The input data was generic, it was only relevant for us the fact tables measures and master data ids. So, on top of those master data records we customized attributes such as codes, names and pictures, resulting in an OLAP Snowflake schema, where there are auxiliary tables to the dimensions to store those customizations such as:
- Customer Name
- Product Brand, Capacity and Container Type
- Account Manager Name
- Director Name & Pictures