Skip to content
/ graphamr Public

Antibiotic prediction from metagenome assembly graphs

Notifications You must be signed in to change notification settings

ablab/graphamr

Repository files navigation

Antimicrobial Resistance Gene Detection from Metagenome Assembly Graphs.

Nextflow

install with bioconda

Introduction

ablab/graphamr is a bioinformatics best-practise analysis pipeline for recovery and identification of antibiotic resistance genes from fragmented metagenomic assemblies. The pipeline involves the alignment of profile hidden Markov models of target genes directly to the assembly graph of a metagenome with further dereplication and annotation of the results using state-of-the art tools. The pipeline supports reads or assembly graph as input. For reads the pipeline does quality control and assembles metagenome and builds graph.

The pipeline is built using Nextflow, a workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It comes with docker containers making installation trivial and results highly reproducible. The Nextflow DSL2 implementation of this pipeline uses one container per process which makes it much easier to maintain and update software dependencies.

Quick Start

  1. Install nextflow

  2. Install any of Conda for full pipeline reproducibility

  3. Download the pipeline and test it on a minimal dataset with a single command:

    nextflow run ablab/graphamr -profile test,<conda>
  4. Start running your own analysis!

    Typical command for analysis using reads:

    nextflow run ablab/graphamr -profile <conda> --reads '*_R{1,2}.fastq.gz' --hmm '*.HMM'

    Typical command for analysis using graph:

    nextflow run ablab/graphamr -profile <conda> --graph '*.gfa' --hmm '*.HMM'

    The pipeline has the ability to download the NCBI AMR HMMs or CARD databases.

    Typical command for analysis with downloaded database:

    nextflow run ablab/graphamr -profile <conda> --graph '*.gfa' --db ['ncbi_AMR_HMM', 'card_AA']

See usage docs for all of the available options when running the pipeline.

Pipeline Summary

Optionally, if raw reades are used:

  • Sequencing quality control (FastQC)
  • Assembly metagenome and building graph (metaSPAdes)

By default, the pipeline currently performs the following:

  • Aligning HMM profile to graph (Pathracer)
  • Detection and clustering ORFs (MMseqs2)
  • Annotation representative sequences (Abricate, RGI, sraX)
  • Summarizing results (hAMRonization)

Documentation

The nf-core/graphamr pipeline comes with documentation about the pipeline: usage and output.

Credits

graphamr was originally written by Daria Shafranskaya, Anton Korobeynikov.