Prof: Michael Schatz (mschatz @ cs.jhu.edu)
TA: Charlotte Darby (cdarby @ jhu.edu)
Class Hours: Tuesday + Thursday @ 1:30p - 2:45p in Shaffer 304
Schatz Office Hours: Tuesday + Thursday @ 3-4p in Malone 323 and by appointment
Darby Office Hours: Wednesday @ 4pm in Malone 216 and by appointment
The primary goal of the course is for students to be grounded in theory and leave the course empowered to conduct independent genomic analyses. We will study the leading computational and quantitative approaches for comparing and analyzing genomes starting from raw sequencing data. The course will focus on human genomics and human medical applications, but the techniques will be broadly applicable across the tree of life. The topics will include genome assembly & comparative genomics, variant identification & analysis, gene expression & regulation, personal genome analysis, and cancer genomics. The grading will be based on assignments, a midterm exam, class presentations, and a significant class project. There are no formal course prerequisites, although the course will require familiarity with UNIX scripting and/or programming to complete the assignments and course project.
- Online introduction to Unix/Linux. Students are strongly recommended to complete one of the following online tutorials (or both) before class begins.
- Access to a Linux Machine, and/or Install VirtualBox (Unfortuantely, even Mac will not work correctly for some programs)
- Alternatively, you can try to use Docker. This has the tools you need preinstalled: https://github.com/srivathsapv/wga-essentials
- Applied Comparative Genomics by Aaron Quinlan
- Algorithms for DNA Sequencing by Ben Langmead
- Computational Biology by Rob Patro
- HarvardX Biomedical Data Science
- PLOS Computational Biology Translational Bioinformatics
- Biostars Handbook
- Molecular Biology of the Gene (Watson et al)
- Molecular Biology of the Cell (Alberts)
- Biological Sequence Analysis (Durbin et al)