Skip to content

Latest commit

 

History

History
173 lines (103 loc) · 2.85 KB

README.md

File metadata and controls

173 lines (103 loc) · 2.85 KB

Case Studies in Data Science: Civic Data

Math 241 | Reed College | Spring 2018

Week 1: Data Visualization I

Tuesday

  • Course Logistics
  • Principles of Data Graphics
    • John Snow's Cholera Map (link)
    • Napoleon's March by Minard (link)
  • Reading: p. 1 - 14
  • Please join the slack group
  • Please join datacamp

Thursday

  • Visual Cues
  • Reading:
    • p. 14 - 22
    • Tufte (see slack group)
  • Datacamp ggplot2 part I due Monday

Week 2: Data Visualization II

Tuesday

  • A grammar of graphics
  • ggplot2
  • Reading:
    • p. 33 - 48
    • Gelman (see slack group)
  • Homework 1 due by 5 pm on thursday
  • Datacamp ggplot2 part II due by 5 pm on thursday

Thursday

Week 3: Data Wrangling

Tuesday

  • A grammar of wrangling
  • dplyr
  • Reading:
    • p. 63 - 79
  • Homework 2 due by 5 pm on next Tuesday
  • Datacamp dplyr due by 5 pm on next Tuesday

Thursday

  • Working with multiple tables
  • Factors
  • Reading:
    • p. 79 - 88

Week 4: Tidy Data

Tuesday

  • Di Cook: Tidy Data and Visual Inference
  • Homework 3 due by 5 pm on next Thursday

Thursday

  • Tidy Data
  • Reading:
    • p. 91 - 104
    • Tidy Data, Hadley Wickham You're also encouraged to read this paper. It's a bit dated - he has since reorganized these ideas into the tidyr package - but it's useful to see the original formulation.

Week 5: Data Import

Tuesday

  • Tidy Data II
  • Writing functions
  • Data Import
  • Reading:
    • p. 116 - 128

Thursday

  • Data Import
  • Data Types
  • Lubridate
  • Homework 4 due by next 5 pm next Tuesday

Week 6: Ethics and Modeling

Tuesday

  • Discussion: Ethics
  • Reading:
    • p. 131 - 144

Thursday

  • Overview of Modeling
  • Linear Regression
  • Reading:
    • p. 465 - 477

Week 7: Modeling, cont.

Tuesday

  • Guest: Paul Gronke, Q&A about voter registration and participation in Oregon
  • Activity: Data Validation and EDA

Thursday

  • Logistic Regression
  • Reading:
    • p. 477 - 481, 188 - 196

Week 8: Modeling, Projects

Tuesday

  • Project brainstorm (brainstorm due by 4 pm today)
  • Modeling with Logistic Regression

Thursday

  • GitHub Workshop

Week 9: Projects

Tuesday

  • Groups assigned
  • Construct group repo and start on proposal

Thursday

  • No class: group meetings

Week 10: Spatial Data

Tuesday

  • Working with spatial data

Thursday

  • TBA