Machine learning detection of unstable antibiotic heteroresistance in E. coli

The pipelines are created using Snakemake

Data analysis and modelling are performed using R and tidyverse.

Snakemake pipelines

Main pipeline

File: workflow/snakefile.smk

Purpose: assembling and annotating E. coli genomes (resistance genes, IS elements, direct repeats) from both short and long sequencing reads.

Configuration file: workflow/config.yaml

To run the pipeline short and long reads should be in resources/data_raw/{strain}/short/ and resources/data_raw/{strain}/long/ directories.

DAG:

Phylogeny pipeline

File: workflow/phylogeny.smk

Purpose: phylogenetic analysis of the samples including 27 reference strains.

Configuration file: workflow/config_phylogeny.yaml

Analysis of mutants

File: mutants.smk

Purpose: analysis of the HR mutants.

Configuration file: workflow/config_mutants.yaml

DAG:

Data analysis and machine learning

For feature generation see notebooks/modelling/features.qmd.

For exploratory data analysis of the features, see file notebooks/modelling/EDA.qmd,

For training and validation procedures, see notebooks/modelling/training_and_validation.Rmd,

For analysis of the models, see notebooks/modelling/models_analysis.Rmd

Features table: notebooks/modelling/data/features_strain.csv

Strains will be available from SRA under BioProject PRJNA1165464

Name		Name	Last commit message	Last commit date
Latest commit History 1,515 Commits
images		images
notebooks		notebooks
workflow		workflow
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine learning detection of unstable antibiotic heteroresistance in E. coli

Snakemake pipelines

Main pipeline

Phylogeny pipeline

Analysis of mutants

Data analysis and machine learning

About

Releases

Packages

Contributors 2

Languages

andrewgull/HeteroR

Folders and files

Latest commit

History

Repository files navigation

Machine learning detection of unstable antibiotic heteroresistance in E. coli

Snakemake pipelines

Main pipeline

Phylogeny pipeline

Analysis of mutants

Data analysis and machine learning

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages