The world is a dangerous place. During 2015 and 2016, there was a lot of discussion in the news about police shootings. FiveThirtyEight reported on gun deaths in 2016. As leaders in data journalism, they have posted a clean version of this data in their Github repo called full_data.csv for us to use.
The FiveThirtyEight data is a bit old, and our client would like us to use the most recent data from the CDC. He would also like the parser script to use Python instead of R.
While their visualizations focused on yearly averages, our client wants to create commercials that help reduce gun deaths in the US. They would like to target the commercials in different seasons of the year (think month variable) to audiences that could significantly reduce gun deaths. Our challenge is to summarize and visualize seasonal trends across the other variables in these data.
- R4DS: Chapter 5 Data transformation
- R4DS: Chapter 7 Exploratory Data Analysis
- Py4DS: Chapter 5 Data transformation
- Py4DS: Chapter 7 Exploratory Data Analysis
- Translate FiveThirtyEight's CDC_parser.R script into a Python script.
- Verify that your output matches their results.
- Build new data sets that use data through 2019.
- Provide a brief summary of the FiveThirtyEight article.
- Create one plot that provides similar insight to their visualization in the article. It does not have to look like theirs.
- Write a short paragraph summarizing their article.
- Address the client's need for emphasis areas of their commercials for different seasons of the year.
- Provide plots that help them know the different potential groups (variables) they could address in different seasons (2-4 visualizations seem necessary).
- Write a short paragraph describing each image.