Below is an overall description of our tool. For details and citations about this work, please check:
- Python 3.11.4
-
Create a virtual environment:
python 3.12.1 -m venv venv_name
-
Activate the virtual environment:
source venv/bin/activate
-
Install required packages:
pip install -r requirements.txt
Run gseconverter.py
by providing the GSE id as input:
input_gseid = "GSE210222"
This code will download soft and gpl files and create a tab-separated .csv file with raw count values.
Execute tpm_converter.py
to convert the counts matrix file to a TPM-normalized matrix.
Run ssgsea.py
by providing the matrix.csv file and genesets.gmt file for GSEA analysis. The code will generate a scatter plot of samples with PCA values.
Finally, execute main.py
to generate plots showcased in the paper.
For RNA-Seq data where the .soft file lacks matrix values, download the matrix file separately.
This project is licensed under the MIT License.