data-catalogue-pseudonymisation

This is the repository for the pseudonymisation part of the BBMRI.cz data catalog.

Pseudonymisation

Pseudonymizes predictive numbers, collects clinical data and removes unnecessary files before moving the data to SensitiveCloud at ICS-MUNI.

Supported sequencing types

Miseq, New Miseq, MammaPrint

How to run the scripts

Locally - Development

Using main.py

Install requirements

pip install -r requiremenents.txt

Run main.py

python main.py -s /path/to/runs/for/pseudonymization -d /path/to/sensitive/cloud/destination 
               -t /path/to/pseudonymisation/tables/folder -l /path/to/libraries 
               -lsc /path/to/sensitive/cloud/libraries"

Using docker-compose

docker-compose up -f compose.dev.yml -d --build

In production

Using docker-compose

# connect to seq server
su export
cd /home/export/data-catalogue-pseudonymisation
docker-compose up -f compose.prod.yml -d

Deployment in cron

# connect to seq serve
su export
crontab -e
# setting cron to run every Monday, Wednesday, Friday at 22:00
0 22 * * 1,3,5 /usr/local/bin/docker-compose -f /home/export/data-catalogue-pseudonymisation/compose.prod.yml up -d &>> /home/export/logs/`date +\%Y\%m\%d\%H\%M\%S`.log

Deploying new version in production

su export
cd /home/export/data-catalogue-pseudonymization
git switch main
git pull

The new version shouldthe new version should automatically start in production once the cronjob is run automatically start in production once the cronjob is run.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.github/workflows		.github/workflows
pseudonymization		pseudonymization
tests		tests
.gitignore		.gitignore
Dockerfile.dev		Dockerfile.dev
Dockerfile.prod		Dockerfile.prod
LICENSE		LICENSE
README.md		README.md
compose.dev.yml		compose.dev.yml
compose.prod.yml		compose.prod.yml
dev-config.cfg		dev-config.cfg
main.py		main.py
prod-config.cfg		prod-config.cfg
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

data-catalogue-pseudonymisation

Pseudonymisation

Supported sequencing types

How to run the scripts

Locally - Development

Using main.py

Using docker-compose

In production

Using docker-compose

Deployment in cron

Deploying new version in production

About

Releases 1

Packages

Contributors 2

Languages

License

BBMRI-cz/data-catalogue-pseudonymisation

Folders and files

Latest commit

History

Repository files navigation

data-catalogue-pseudonymisation

Pseudonymisation

Supported sequencing types

How to run the scripts

Locally - Development

Using main.py

Using docker-compose

In production

Using docker-compose

Deployment in cron

Deploying new version in production

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages