In this case study, I had four tasks.
- The first task is to manually download DNA and protein sequence data to your computer.
- The second task is to import the DNA data into Python.
- The third task is to create an algorithm that translates to DNA using the translation table we will provide.
- The fourth task, the final task, is to check if your DNA translation matches the protein string you have downloaded.
I would be using the NCBI which is the National Center for Biotechnology Information, and it is United States' main public repository of DNA and related information. I will download two files from the NCBI. The first is a strand of DNA and the second is the corresponding protein sequence of amino acids translated from this DNA.
Link: https://www.ncbi.nlm.nih.gov/nuccore/NM_207618.2
This was a part of HarvardX PH526x Using Python for Research course provided by edx .