This is part of my Master's Thesis project for the University of Granada (UGR).
miRPIPE is a set of scripts to parallelize the analysis of miRNA in human samples with sRNAtoolbox
(https://bioinfo2.ugr.es/srnatoolbox/). Feel free to use it to parallelize your projects.
-
Download the miRPIPE folder from https://github.com/marromesc/miRPIPE/tree/master/miRPIPE
-
Move it to your opt folder.
mv miRPIPE.zip /home/usr/opt/miRPIPE.zip*
- Unzip
unzip miRPIPE.zip
- Run one by one for each script:
chmod +x script_name.sh
- Creating working directory with these folders:
- data for the downloaded samples
- logs for the log files that you can check if there is any error
- batch for the batch files in .bat format
- results for the results
- Creating batch file for sRNAbench analysis.
USAGE:
batch_prepro_bench.sh /home/user/working_directory /path/to/SraRunTable.txt samples_downloaded adapter_ask adapter_sequence
If you have already downloaded the samples type 1 in samples_downloaded. If not, type 0. If adapter_ask is 0, it means that there is no adapter. Type 1 in adapter_ask to guess the adapter or 2 if there is an adapter.
OUTPUTS:
/home/user/working_directory/experiment/batch/prepro.bat
. An optionally batch file output. Running it you will download the samples parallelly if you have not done before./home/user/working_directory/experiment/batch/bench.bat
. Runnng it you will perfom sRNAbench analysis./home/user/working_directory/experiment/logs/parameter_file.txt
. Samples report. It will be used by other scripts.
Creates a batch file that you can run to analyze the differential expression of your data.
USAGE:
sh DE.sh /home/user/working_directory /path/to/SraRunTable.txt column*
In the column, type the name of the SRA Table's column where it is the phenotypes conditions.
OUTPUT:
/home/user/working_directory/experiment/batch/DE.bat
. Running it you will also run parallely both miRBase and miRGeneDB differential expression analysis.
quality_creator.sh
A simple script to parallelize quality sRNAbench analysis.
USAGE:
sh batch_QUALITY.sh QUALITY_parameter_file.txt
In the parameter file, type 0 for mean analysis, 1 for min analysis or 2 for noMM=0. You have a QUALITY_parameter_file.txt
example in the documentation.
OUTPUTS:
/home/user/working_directory/experiment/batch/quality_analysis_commands.bat
. Running it you will also run parallelly BATCH QUALITY for all the analysis indicated before./home/user/working_directory/experiment/batch/quality_boxplot_commands.bat
. Running it you will also run parallelly BATCH QUALITY BOXPLOT for all the analysis indicated before.
Creates a batch file for quality sRNAbench analysis parallely.
USAGE:
sh batch_QUALITY /home/user/working_directory /path/to/SraRunTable.bat mean_or_min_or_noMM Q
Type 0 for mean analysis, 1 for min analysis or 2 for noMM=0
OUTPUTS:
The output will be generated in the batch folder. Example: /working_dir/experiment/batch/batch_QUALITY_type_20SRP049635.bat*
. Running it you will run BATCH_QUALITY
script parallelly for all the analyses indicated in the parameter file.
Create the input to generate a boxplot to visualize the quality in the isoMiRs profiles with ggplot
package in R.
USAGE:
sh batch_ggplot_quality.sh /home/user/working_directory /path/to/SraRunTable.batt folder_output_name
OUTPUTS:
The output will be generated in the logs folder. Example: /working_dir/experiment/logs/folder_output/input_+.txt
.
Visualization of the quality impact in isomiRs and other variants with ggplot
package in R.
USAGE:
Rscript ggplot_quality.R results_directory path_to_input
OUTPUTS:
The output will be generated in the results_directory
that you have created previously.