PIpeline for MetaGenomic Analysis of Viruses
The main goal of the PIMGAVir pipeline is to provide the user with a preliminary taxonomic classification of the data to be analyzed. In literature, three are the more used methods to this scope: reads-based, assembly-based, and clustering-based. PIMGAVir pipeline gives the user the opportunity to analyze the data using one, more, or all the strategies in parallel.
PIMGAVir runs the method of investigation chosen, which will perform the next steps:
- Read_based will make the taxonomic classification starting from the file obtained by the pre-process/reads_filtering task
- Ass_based, moving from the file obtained by the pre-process/reads_filtering task, will make the taxonomic classification
- Clust_based will perform the clustering of the reads gained from the pre-process/reads_filtering task, create the phylogenetic tree and make the taxonomic classification
Note that the user can run the pimgavir.sh script with more than one “strategy” option at the same time.
For example, the following command
pimgavir.sh R1.fq R2.fq SampleName 24 --read_based --ss_based --filter
will run the pipeline to execute both the strategies, —read_based and —ass_based.
Refer to PIMGAVir_Installation_Manual.pdf for a complete guide on configuring and installing PIMGAVir.
Refer to PIMGAVir_User_Manual.pdf for a quick use of PIMGAVir.
Refer to PIMGAVir_SYA.pdf for a more detailed description of PIMGAVir infrastracture.
Table n1. and Table n2. report the list of packages needed to install both PIMGAVir and Vir-MinION.
Refer to PIMGAVIR-CAMISIM.txt for generating synthetic data using CAMISIM and run PIMGAVir using it. If you prefer to use synthetic data used during the PIMGAVir test, download the following file SyntheticData_PIMGAVir.tgz
Refer to CreateDBs.txt for a step-by-step guide on installing and configuring the DBs need by PIMGAVir
Please, refer to the manuscript "Mastriani E, Bienes KM, Wong G, Berthet N. PIMGAVir and Vir-MinION: Two Viral Metagenomic Pipelines for Complete Baseline Analysis of 2nd and 3rd Generation Data. Viruses. 2022 Jun 10;14(6):1260. doi: 10.3390/v14061260. PMID: 35746732; PMCID: PMC9230805." for more information.