-
Notifications
You must be signed in to change notification settings - Fork 0
/
0204_datascience.Rmd
161 lines (105 loc) · 5.92 KB
/
0204_datascience.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
# 703: Data Science {-}
```{r setup, include=FALSE}
source("scheduler.R")
knitr::opts_chunk$set(cache=FALSE, dev='pdf')
first_day <- as.Date("2025-01-13")
cache=F
number_of_classes=29
number_of_weeks=16
####
#devtools::install_github("crsh/citr")
library(tidyverse)
library(citr)
library(RefManageR)
bib <- ReadBib("book.bib")
bib <- c(bib, as.BibEntry(citation()))
bib[length(bib)]$key = "Rcite"
topic_vjoe=c(
"Introduction and What is Data Science",
"Graphical Data Analysis",
"Graphical Masterpieces",
"Scatterplots and Correlations",
"Quiz 1",
"Exploratory Data Analysis",# "Spring Break",# "",
"Data Transformations",
"Robustness and Resistence",
"Replication",
"Quiz 2",
"Modern Data Science",
"Archival Data",
"Data mining and CART",
"Special Topics",
"Special Topics" )
topic =c(
"What is Data Science and Meet the Toolkit",
"Data and Visualization",
"Welcome to the Tidyverse",
"Data types and Data Transformations",
"Effective data visualization",
"Scientific Communication and confounding",
"Functions and Automation",# "Spring Break",# "",
"Webscraping",
"Spring Break",
"Ethics",
"Tidy Models",
"Overfitting and Cross validation",
"Quantifying uncertainty",
"Rshiny",
"Machine Learning",
"Presentations",
"NA",
"NA"
)
classschedule <- create_classschedule(first_day = first_day,
number_of_weeks = number_of_weeks,
weekday_due = "m",
topic = topic,
exams_week= 2,
assignments_week= 16,
exams_name = "Contract",
assignments_name = "Presentation")
```
## Course Description {-}
Data Science for Psychologists (DS4P) introduces the principles of data science, including data wrangling, modeling, visualization, and communication. In this class, we link those principles to psychological methods and open science practices by emphasizing exploratory analyses and description, rather than confirmatory analyses and prediction. We'll work our way thru Wickham and Grolemund's R for Data Science text (http://r4ds.had.co.nz/) and develop expertise in tidyverse (https://www.tidyverse.org/). This class emphasizes replication and reproducibility. DS4P is a practical skilled-based class and should be useful to students aiming for academia as well as those interested in industry. Applications of these methods can be applied to a full range of psychological areas, including perception (e.g, eye-tracking data), neuroscience (e.g., visualizing neural networks), and individual differences (e.g., valence analysis).
### Broad goals for the course: {-}
- 1. Reproducibility;
- 2. Replication;
- 3. Robust Methods;
- 4. Really Nice Visualization; and
- 5. R
## Required Materials {-}
All materials for this course are open source, including the multimedia course notes.
- Garrison's Data Science for Psychologists (https://datascience4psych.github.io/DataScience4Psych/)
- Wickham and Grolemund's R for Data Science text (http://r4ds.had.co.nz/)
## Course Modality {-}
This course is designated as a blended course. This designation is because this course's modality changes based on the level of COVID-19 transmission in the community. When COVID-19 community transmission is low or medium, this class will be in-person and masking will be required. However, if COVID-19 transmission is high in the community, this course may be moved entirely online in order to protect the health and safety of students and instructor. In the case that we move online, the in-person sessions will become synchronous online sessions. Any changes will be clearly and promptly communicated via email. More specifics on the mask requirement can be found under my public health policies.
## Course Assignments {-}
You will be formally evaluated in three different ways. Instructions for the assignments can be found in the [course notes](https://datascience4psych.github.io/DataScience4Psych/).
<!--
### Presentation {-}
You will give one presentation. This presentation will be approximately 10-minutes. This presentation is chance to practice a formal presentation in a relatively pressure-free setting).
-->
### Labs {-}
There is (at least) one lab available per module. You can find the modules in the [course notes](https://datascience4psych.github.io/DataScience4Psych/).
### Portfolio {-}
The major semester projects (described on the data science website) is a set of DS4P Projects, which you will summarize in a DS4P Portfolio to be handed in on the last day of class. Instructions for [the portfolio](https://datascience4psych.github.io/DataScience4Psych/guidance.html#portfolio) can be found in the [course notes](https://datascience4psych.github.io/DataScience4Psych/).
### Engagement {-}
This experiential course requires active engagement. There will be few lectures and we will not be building toward an exam. Instead, we will work together to build our facilities for thinking critically about data. You should come to every class having read all of the required reading, watched the required videos, browsed the suggested resources, and so forth. You should enter the classroom prepared to discuss these materials with colleagues and complete both individual and group in-class assignments.
## Grading Policy {-}
```{r child = './common/content/contract.Xmd'}
```
### Contract Grades {-}
```{r child = './common/content/contract_grades_ds4p.Xmd'}
```
## Tentative Module Release Schedule {-}
Note that I'll be using your contracts to guide my tinkering.
```{r message = FALSE, warning = FALSE, echo = FALSE}
classschedule %>%
DT::datatable(rownames = FALSE,
class = 'cell-border stripe',
#filter = list(position = 'top'),
options = list(pageLength = nrow(classschedule$module),
autoWidth = FALSE,
bInfo = FALSE,
paging = FALSE))
```