This repository contains some looper pipeline_interface
files, so you can run basic utilities on your PEP projects using looper.
You can run any of these utilities on all samples of any PEP-compatible project like so:
- Point your project at one of these pipeline_interface files (I usually use a subproject)
- Specify the required sample attributes (by adding appropriate columns to your annotation)
- Run
looper
(be sure to activate the subproject if you specified one)
For example, to use the rsync_piface.yaml
pipeline interface, add this to your project_config.yaml
:
subprojects:
rsync:
metadata:
pipeline_interfaces: ${CODE}pifaces/rsync_piface.yaml
Then, add src
and dest
columns with filenames to your sample_annotation
(these are what rsync
needs), and then run with:
looper run project_config.yaml --sp rsync
This will run rsync OLD NEW
for each sample in your project, allowing you to sync raw data from one file system to another, including remote syncing via SSH!
Now here's detailed documentation of what each task does and what it requires.
Task: Runs rsync on files in your project for each sample, to sync remote data to a local file system (or vice versa), or to move files from one local filesystem to another.
Required columns: src
specifies the source filename; dest
specifies the destination.
Task: Returns the file size (in bytes) for files for each sample.
Required columns: file
specifying the path and filename to assess.