Cellenics Pipeline

The Cellenics pipeline project for dependency-managed work processing.

Getting started

The steps of the pipeline that are run through this project are started spontaneously on your machine as Docker containers, simulating Kubernetes in local development.

We have included a utility so you can automatically monitor containers spawned and read their logs as they are executing.

For local development, you should already have Docker and Node.js installed, as well as Inframock running.

Afterwards, you can install the pipeline dependencies with:

make install

To build and run the pipeline:

make build && make run

A similar message should appear:

> node src/app.js

Loading CloudFormation for local container launcher...
Creating mock Lambda function on InfraMock...
No previous stack found on InfraMock.
Stack with ARN arn:aws:cloudformation:eu-west-1:000000000000:stack/local-container-launcher/106d1df9 successfully created.
Waiting for Docker events...

Logs from pipelines run through the API will apear here.

Rebuiling the docker images

make build

Local development and adding dependencies

First make sure the project library is synchronized with the lockfile:

# inside pipeline-runner folder
renv::restore()

NOTE: To restore Bioconductor packages your R version needs to be the same as in the Dockerfile (4.2.0).

install.packages(...) and use them (e.g. dplyr::left_join(...)) as you normally would. Then, update the lockfile:

renv::snapshot()

commit the changes to the lockfile (used to install dependencies in the Dockerfile). See renv docs for more info.

Development dependencies

Packages used for interactive development, such as devtools, usethis, roxygen2, styler and the R languageserver (to develop R in vscode!) and their dependencies should not be added to the lockfile, since they are not required at runtime. renv has been configured to ignore them.

To install them, run the following block, with no arguments. This installs all packages in the DESCRIPTION file, which includes the development dependencies in the Suggests section.

renv::install()

Running tests locally

There are several ways to run tests locally. The easiest one being using the Rstudio shortcut Cmd + shift + T.

Other ways to run tests locally:

devtools::test()

testthat::test_local()

Debugging locally

TLDR: save something inside /debug in a data processing or gem2s step to access it later from ./local-runner/debug.

TLDR2: if the pipeline throws an error, tryCatchLog will save a dump file in ./local-runner/debug that can be used for inspecting the workspace and object values along the call stack.

To save the parameters (config, seurat_obj, etc) to a data processing task function, specify DEBUG_STEP. Available tasks include all task names listed in run_processing_step init.R as well as DEBUG_STEP=all to save the parameters to all data processing task functions:

# e.g. DEBUG_STEP=dataIntegration
DEBUG_STEP=task_name make run

When the pipeline is run, it will save the parameters to the specified task_name in $(pwd)/debug. You can load these into your R environment:

# clicking the file in RStudio does this for you
load('{task_name}_{sample_id}.RData')

# if you need to load multiple tasks, you can load each into a seperate environment
# you would when access objects using e.g. task_env$scdata
task_env <- new.env()
load('{task_name}_{sample_id}.RData', envir = task_env)

Troubleshooting

Linux Mint 20.3 Cinnamon

Error in curl::curl_fetch_memory(url, handle = handle) : 
Timeout was reached: [172.17.0.1:4566] Connection timeout after 60001 ms
Calls: init ... request_fetch -> request_fetch.write_memory -> <Anonymous>
Execution halted

Turn off firewall or allow incoming traffic. This would allow AWS to send packages to the pipeline, which would otherwise be blocked by the firewall.

Open Firewall Configuration from the Start Menu.
Select Allow in the Outgoing dropdown menu (Alternatively, set Status to OFF).

Name		Name	Last commit message	Last commit date
Latest commit History 1,343 Commits
.github		.github
chart-infra		chart-infra
local-runner		local-runner
pipeline-runner		pipeline-runner
.ci.yaml		.ci.yaml
.devcontainer.json		.devcontainer.json
.flux.yaml		.flux.yaml
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.html		README.html
README.md		README.md
codecov.yaml		codecov.yaml
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cellenics Pipeline

Getting started

Rebuiling the docker images

Local development and adding dependencies

Development dependencies

Running tests locally

Debugging locally

Troubleshooting

Linux Mint 20.3 Cinnamon

About

Releases

Packages

Languages

License

learn-bioinformatics/cellenics-pipeline

Folders and files

Latest commit

History

Repository files navigation

Cellenics Pipeline

Getting started

Rebuiling the docker images

Local development and adding dependencies

Development dependencies

Running tests locally

Debugging locally

Troubleshooting

Linux Mint 20.3 Cinnamon

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages