This is the code for the analyses performed in the publication ().
Before running the with different datasets the repository needs to be set up accordingly.
- create a file .config.json in the root of the repository with the following structure:
{ "DATAPATH": "/path/to/datasets", "RESULTSPATH": "/path/to/results" }
- Put your data in the folder that is defined in DATAPATH. Each each publication should get a new folder and in this folder each SRA entry is a single .csv file. To generate datasets the DI identification pipeline by Alnaji et al. can be used.
- Install all dependencies. This can be done by conda using the following command:
conda env create --file=env.yml