Chromatin-Sequence Hot encoded Analysis PileLine with DeepSEA network
Explore the docs »
View Demo
·
Report Bug
·
Request Feature
This pipeline is built for the purpose of analysis directly from a summary statistics file for a Genome Wide Analysis Study with the tab seperated format containing columns with labels having :
- SNP - single nucleotide variants as either rsids or similar labels
- CHR - chromosome of the SNP
- BP - base pair position of the SNP
- A1 - effect allele of the SNP
- A2 - alternate allele of the SNP Using the DeepSEA model, built with the convolution networks in deepleaninig using pytorch and has to be worked with GPU and download the pickle file for the best hyperparameters that can be found at GoogleLink. Make sure to have this file in the same directory you run the whole script.
- DeepLearning Packages - Pytorch, Sklearn
- Data Handling packages - Pandas, Numpy, os, Pickle
- CLI Packages - Click
- BioPython
Clone the whole git repository into your own system, unix-based OS is required for now. Install the required packages using pip and download the best hyperparameters model in the same folder where you run the script.
Just using the simple python command with the other mentioned parameters, as follows,
python CSHAL.py --ss='Path_To_Summary_Statistics_File' --w=500 --email='ENtrez_account_email_id' --ak='API KEY' --det=choose among log/all/both
with that, you should be prompted with multiple options if you did not fill it up right.
- ss --> A summary statistics tab seperated file with columns and headers as SNP,CHR,BP,A1,A2. The columns containing Single Nulceotide Polymorphisms, chormosome, base pair locations, primary allele and secondary allele.
- --w --> The upstream and downstream width from base pair.
- --email --> Email for the Entrez ID to obtain sequences.
- --ak --> API Key from NCBI for faster processing.
- --det --> Options for the detail in the output file. log only gives basic log terms;all provides all 919 labels and values; both provides both the files.
For more detailed explanation on command line, use (python CSHAL.py --help
)
See the open issues for a list of proposed features (and known issues).
Contributions are what make the open source community such an amazing place to be learn, inspire, and create. Any contributions you make are greatly appreciated.
- Fork the Project
- Create your Feature Branch (
git checkout -b feature/AmazingFeature
) - Commit your Changes (
git commit -m 'Add some AmazingFeature'
) - Push to the Branch (
git push origin feature/AmazingFeature
) - Open a Pull Request
Distributed under the MIT License. See LICENSE
for more information.
PradeepVarathan - @VarathanPradeep - pradeepvarathanpugalenthi@gmail.com
Project Link: https://github.com/PradoVarathan/C-SHAL-DeepSEA