1 Understand the client and business problem at hand. 2 Identify the requirements that need to be delivered for this project. 3 Identify which tasks you should focus on as a Data Analyst. Client's Problem that Accenture is tasked to address : The client has reached a massive scale within recent years and does not have the resources internally to handle it. Three requirements that Accenture is tasked to fulfill : Audit of big data practice, recommendations for IPO, analysis of popular content brief document from social buzz has been attched
Often you won’t need all these datasets to find what you’re looking for. So, the first step is to use this data model to identify which datasets will be required to answer your business question - which is to to figure out the top 5 categories with the largest popularity. After Analysis we got data sets needed to complete analysis:
- Reaction Score(score is used to quantified the popularity)
- Content ID
- Reaction Types
- Content type
- Category
Clean the data by:
- removing rows that have values which are missing,
- changing the data type of some values within a column, and
- removing columns which are not relevant to this task.
- Think about how each column might be relevant to the business question you’re investigating.
- If you can’t think of why a column may be useful, it may not be worth including it.
Create a final data set by merging 3 tables End result will be one spreadsheet
- A cleaned dataset
- Top 5 categories
So, the cleaned data set after data modelling & data cleaning :Cleaned Dataset.xlsx
Data Visualization and Storytelling: Make the Powerpoint presentation as per the given template Charts Involved :
- Pie Chart
- Bar Chart Powerpoint Presentation : PPT
Present to the Client: Present your powerpoint presentation to the client and deliver the insights of your analysis