Welcome to Economics 607: Econometrics III (Spring 2019) at the University of Oregon (taught by Dr. Ed Rubin).
Lecture Monday and Wednesday 10:00pm–11:50pm, Zoom (See Canvas)
Lab Friday 12:00pm–12:50pm, Zoom (See Canvas)
Office hours
- Ed Rubin TBD, Zoom (See Canvas)
- Colleen O'Briant TBD, Zoom (See Canvas)
We will mainly use two books.
Mostly Harmless Econometrics: An Empiricist's Companion (MHE)
by Angrist and Pischke
Your new best friend. Read it.
Microeconometrics (C&T)
by Cameron and Trivedi
Also very readable and accessible.
Runner up (the standard):
Econometric Analysis (Greene)
by Greene
Encyclopedic resource for all (most?) of the questions MHE does not answer.
Note: The linked slides (below) are .html
files that will only work properly if you are connected to the internet. If you're going off grid (camping + metrics?), grab the PDFs. You'll miss out on gifs and interactive plots, but the equations will actually show up. I've removed the within-slide (incremental) pauses in the (no pauses) PDF slides.
The content of the lectures mainly follows MHE and Michael Anderson—with additional inspiration from Max Auffhammer and many other sources.
Another note on the notes: I create the slides with xaringan
in R. Thanks to Grant McDermott for encouraging me to make this switch.
Lecture 01: Research + R + You = 💖
- An introduction to empirical research via applied econometrics.
- R: Light introduction—objects, functions, and help.
Note formats: .html |
.pdf |
.pdf (no pauses) |
.Rmd
Readings: MHE preface + MHE chapter 1
Lecture 02: The Experimental Ideal
- Neyman potential outcomes framework (Rubin causal model)
- Selection bias and experimental variation in treatment
- R: Object types/classes and package management.
Note formats: .html |
.pdf |
.pdf (no pauses) |
.Rmd
Readings: MHE chapter 2
- What's the big deal about least-squares (population) regression?
- What does the CEF tell us?
- How does least-squares regression relate to the CEF?
Note formats: .html |
.pdf |
.pdf (no pauses) |
.Rmd
Readings: MHE chapter 3.1
Lecture 04: Inference and Simulation
- How do we move from populations to samples?
- What matters for drawing basic statistical inferences about the population?
- How can we learn about inference from simulation?
- How do we run (parallelized) simulations in R?
Note formats: .html |
.pdf |
.pdf (no pauses) |
.Rmd
Readings: MHE chapter 3
- Saturated models
- When is regression causal?
- The conditional-independence assumption
Note formats: .html |
.pdf |
.pdf (no pauses) |
.Rmd
Readings: Still MHE chapter 3
- Omitted-variable bias
- Good and bad controls
Note formats: .html |
.pdf |
.pdf (no pauses) |
.Rmd
Readings: Still MHE chapter 3
- Matching estimators: Nearest neighbor and kernel
- Propensity-score methods: Regression control, treatment-effect heterogeneity, blocking, weighting, doubly robust
Note formats: .html |
.pdf |
.pdf (no pauses) |
.Rmd
Readings: MHE chapter 3 + C&T section 25.4
Lecture 08: Instrument Variables
- General research designs
- Instrumental variables
- Two-stage least squares
- Heterogeneous treatment effects and the LATE
Note formats: .html |
.pdf |
.pdf (no pauses) |
.Rmd
Readings: MHE chapter 4 + C&T sections 4.8–4.9
Lecture 09: Regression Discontinuity
- Sharp regression discontinuities
- Fuzzy regression discontinuities
- Graphical analyses
Note formats: .html |
.pdf |
.pdf (no pauses) |
.Rmd
Readings: MHE chapter 6 + C&T sections 25.6
Lecture 10: Inference: Clustering
- General inference
- Moulton
- Cluster-robust standard errors
Note formats: .html |
.pdf |
.pdf (no pauses) |
.Rmd
Readings: MHE chapter 8
Lecture 11: Inference: Resampling and Randomization
- Resampling
- The bootstrap
- Permutation tests (Fisher)
- Randomization inference (Neyman-Pearson)
Note formats: .html |
.pdf |
.pdf (no pauses) |
.Rmd
Readings: MHE chapter 6 + C&T sections 25.6
Lecture 12: Machine learning (in one lecture)
- Prediction basics
- The bias-variance tradeoff
- In-sample vs. out-of-sample performance
- Hold-out methods (including cross validation)
- Ridge regression and lasso
Note formats: .html |
.pdf |
.pdf (no pauses) |
.Rmd
Readings: Introduction to statistical learning
Note: From previous iteration of our class.
- Object types/classes/structures
- Package management
- Math and stat. in R
- Indexing
Note formats: .html | .html (no pause) | .pdf | .pdf (no pause) | .Rmd
Solutions:
.html | .pdf
- Data frames
- Data work with
dplyr
Note formats: .html | .html (no pause) | .pdf | .pdf (no pause) | .Rmd
Lab 03: RStudio + Data i/o with R
- RStudio
- Getting data into and out of R
Note formats: .html | .html (no pause) | .pdf | .pdf (no pause) | .Rmd
lm()
andlm
objectsestimatr
andlm_robust()
- Other regressions, e.g.,
glm()
Note formats: .html | .html (no pause) | .pdf | .pdf (no pause) | .Rmd
- Default
plot()
methods ggplot2
Note formats: .html | .html (no pause) | .pdf | .pdf (no pause) | .Rmd
- General simulation strategies
- Simulating IV in finite samples
Note formats: .html | .html (no pause) | .pdf | .pdf (no pause) | .Rmd
Lab 07: Miscellaneous R Tips and Tricks
- The
apply
family for()
loops- Lists
- Logical vectors and
which()
Note formats: .html | .html (no pause) | .pdf | .pdf (no pause) | .Rmd
2–5 problem sets combining econometric theory and R.
Problem set 1: problems | solutions
Problem set 2: problems | solutions
Problem set 3: problems with dataset 1, dataset 2, dataset 3, and more data | solutions
Building a research project/proposal.
Step 1: Research question (causal relationship of interest) and motivation.
- Assignment: Pitch a project—including the causal question of interest, the motivation, and (optional) how you could answer the question.
- This project should be something you could turn into a legitimate research project.
- Length: Between 2 sentences and 2 paragraphs (think abstract—read abstracts if necessary).
Due 15 April 2020 (Canvas)
Due 27 May 2020 (Canvas)
Step 3: Presentation of project pitch
Due 05 June 2020, sign up on Canvas.
- Inference and simulation
- Matching
- Instrumental variables
- Regression discontinuity
- Inference: Clustering and resampling
The final exam is due Friday, 12 June 2020 by 11:59pm.
Assignments Each assignment is worth 10% of your course grade.
Project The parts of the group project are jointly worth 25% of your course grade.
Exams The exams will cover the remainder of the points for the course.
- If there are multiple exams, then they will split the remainder equally.
- Example: With 3 assignments, the residual = 100% - (3×10% + 25%) = 45%.
- If we only have a final exam, it would be worth 45%.
- If we have a final exam and a midterm exam, each would be worth 22.5%.
Metrics books
- Hayashi's Econometrics
- Kennedy
- Mastering 'Metrics (undergrad version of Mostly Harmless)
- Stock and Waston
- Wooldridge ("Baby")
- Wooldridge (Adult?)
R resources
- RStudio: Finding Your Way To R
- My EC525 Economics & Machine Learning Course at UO
- Grant McDermott's Data Science of Economists course
- DataCamp's Introduction to R
- R for Data Science
- Advanced R
Metrics and R