Skip to content

contigs_taxonomic_classification

Description

Getting per contig taxonomic annotations for metagenomic assembly based on taxonomic classification of predicted proteins with DIAMOND and CAT.

Installation

ebi-metagenomics/contigs_taxonomic_classification

nf-core modules -g https://www.github.com/ebi-metagenomics/nf-modules install contigs_taxonomic_classification

Keywords: taxonomic classification, catpack, protein classification, metagenomics

Components

This subworkflow uses the following components:

Input

Name Type Description Pattern
contigs file The input channel containing file with metagenomic assembly Structure: [ val(meta), path(fasta) ] *.{fasta, fasta.gz, fa, fa.gz, fna, fna.gz}
proteins file The input channel containing proteins predicted for the given metagenome CAT assumes that predicted proteins have ids in format _ Structure: [ val(meta), path(fasta) ] *.{fasta, fasta.gz, fa, fa.gz, faa, faa.gz}
cat_db path Directory containing the CAT database produced with catpack/prepare The folder must contain DIAMOND db .dmnd, .fastaid2LCAtaxid, *.taxids_with_multiple_offspring Structure: [ val(meta), path(db_folder) ] -
cat_taxonomy_db path Directory containing the tax/ folder of the CAT database produced with catpack/prepare The folder must contain names.dmp and nodes.dmp Structure: [ val(meta), path(tax_db_folder) ] -

Output

Name Type Description Pattern
diamond_output file Channel containing a TSV file with matches of predicted proteins to a database Structure: [ val(meta), path(file) ] *.{txt, txt.gz}
cat_output file Channel containing a TSV file with taxonomic classification of contigs Structure: [ val(meta), path(file) ] *.contig2classification.txt
krona_html file Channel containing a HTML file with Krona plot of taxonomic classification of contigs Structure: [ val(meta), path(file) ] *.html
versions file File containing software versions Structure: [ path(versions.yml) ] versions.yml

People

Authors