Prof: Michael Schatz (mschatz @ cs.jhu.edu)
TA: Arun Das (arun.das @ jhu.edu)
Class Hours: Monday + Wednesday @ 1:30p - 2:45p on Zoom (see Blackboard for link)
Schatz Office Hours: By appointment
Das Office Hours: Monday 12-1pm and by appointment
The primary goal of the course is for students to be grounded in theory and leave the course empowered to conduct independent genomic analyses. We will study the leading computational and quantitative approaches for comparing and analyzing genomes starting from raw sequencing data. The course will focus on human genomics and human medical applications, but the techniques will be broadly applicable across the tree of life. The topics will include genome assembly & comparative genomics, variant identification & analysis, gene expression & regulation, personal genome analysis, and cancer genomics. The grading will be based on assignments, a midterm exam, class presentations, and a significant class project. There are no formal course prerequisites, although the course will require familiarity with UNIX scripting and/or programming to complete the assignments and course project.
- Online introduction to Unix/Linux. Students are strongly recommended to complete one of the following online tutorials (or both) before class begins.
- Access to a Linux Machine, and/or Install Docker (Unfortuantely, even Mac will not work correctly for some programs)
- Syllabus and Policies
- Piazza Discussion Board
- GradeScope Entry Code: 86KZZP
- Spring 2020 Course Materials
- Applied Comparative Genomics by Aaron Quinlan
- Algorithms for DNA Sequencing by Ben Langmead
- Computational Biology by Rob Patro
- HarvardX Biomedical Data Science
- PLOS Computational Biology Translational Bioinformatics
- Biostars Handbook
- Molecular Biology of the Gene (Watson et al)
- Molecular Biology of the Cell (Alberts)
- Biological Sequence Analysis (Durbin et al)
- Modern Statistics for Modern Biology (Holmes & Huber)