Michael Schatz (mschatz @ cs.jhu.edu)
Class Hours: Tuesday + Thursday @ 1:30p - 2:45p in Shaffer 304
Office Hours: Tuesday + Thursday @ 3-4p in Malone 323 and by appointment
The primary goal of the course is for students to be grounded in theory and leave the course empowered to conduct independent genomic analyses. We will study the leading computational and quantitative approaches for comparing and analyzing genomes starting from raw sequencing data. The course will focus on human genomics and human medical applications, but the techniques will be broadly applicable across the tree of life. The topics will include genome assembly & comparative genomics, variant identification & analysis, gene expression & regulation, personal genome analysis, and cancer genomics. The grading will be based on assignments, a midterm exam, class presentations, and a significant class project. There are no formal course prerequisites, although the course will require familiarity with UNIX scripting and/or programming to complete the assignments and course project.
- Online introduction to Unix/Linux. Students are strongly recommended to complete one of the following online tutorials (or both) before class begins.
- Access to a Linux Machine, or Install VirtualBox (Unfortuantely, even Mac will not work correctly for some programs)
- Applied Comparative Genomics by Aaron Quinlan
- Algorithms for DNA Sequencing by Ben Langmead
- Computational Biology by Rob Patro
- Biomedical Data Science by Mark Gerstein
- Bioinformatics Data Structures and Algorithms by Paul Medvedev
- String Algorithms by Carl Kingsford
- HarvardX Biomedical Data Science
- PLOS Computational Biology Translational Bioinformatics
- Biostars Handbook
- Molecular Biology of the Gene (Watson et al)
- Biological Sequence Analysis (Durbin et al)