nf-core/mag
Assembly and binning of metagenomes
1.2.0
). The latest
stable release is
4.0.0
.
Define where the pipeline should find input data and save output data.
Input FastQ files or TSV file.
string
Paths to input FastQ files for tests.
string
Specifies that the input is single-end reads.
boolean
The output directory where the results will be saved.
string
./results
Email address for completion summary.
string
^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$
Options for the reference genome indices used to align reads.
Directory / URL base for iGenomes references.
string
s3://ngi-igenomes/igenomes/
Do not load the iGenomes reference config.
boolean
Less common options for the pipeline, typically set in a config file.
Display help text.
boolean
Method used to save pipeline results to output directory.
string
Workflow name.
string
Email address for completion summary, only when pipeline fails.
string
^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$
Send plain-text email instead of HTML.
boolean
File size limit when attaching MultiQC reports to summary emails.
string
25.MB
Do not use coloured log outputs.
boolean
Custom config file to supply to MultiQC.
string
Directory to keep pipeline Nextflow logs and reports.
string
${params.outdir}/pipeline_info
Set the top limit for requested resources for any single job.
Maximum number of CPUs that can be requested for any single job.
integer
16
Maximum amount of memory that can be requested for any single job.
string
128.GB
Maximum amount of time that can be requested for any single job.
string
240.h
Parameters used to describe centralised config profiles. These should not be edited.
Git commit id for Institutional configs.
string
master
Base directory for Institutional configs.
string
https://raw.githubusercontent.com/nf-core/configs/master
Institutional configs hostname.
string
Institutional config description.
string
Institutional config contact information.
string
Institutional config URL link.
string
Use these parameters to also enable reproducible results from the individual assembly and binning tools .
Fix number of CPUs for MEGAHIT to 1. Not increased with retries.
boolean
Fix number of CPUs used by SPAdes. Not increased with retries.
integer
Fix number of CPUs used by SPAdes hybrid. Not increased with retries.
integer
RNG seed for MetaBAT2.
integer
1
Sequence of 3’ adapter to remove in the forward reads.
string
AGATCGGAAGAGCACACGTCTGAACTCCAGTCA
Sequence of 3’ adapter to remove in the reverse reads.
string
AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT
Mean qualified quality value for keeping read.
integer
15
Trimming quality value for the sliding window.
integer
15
Name of iGenomes reference for host contamination removal.
string
Fasta reference file for host contamination removal.
string
Use the --very-sensitive
instead of the--sensitive
setting for Bowtie 2 to map reads against the host genome.
boolean
Save the read IDs of removed host reads.
boolean
Keep reads similar to the Illumina internal standard PhiX genome.
boolean
Genome reference used to remove Illumina PhiX contaminant reads.
string
${baseDir}/assets/data/GCA_002596845.1_ASM259684v1_genomic.fna.gz
Skip removing adapter sequences from long reads.
boolean
Discard any read which is shorter than this value.
integer
1000
Keep this percent of bases.
integer
90
The higher the more important is read length when choosing the best reads.
integer
10
Keep reads similar to the ONT internal standard Escherichia virus Lambda genome.
boolean
Genome reference used to remove ONT Lambda contaminant reads.
string
${baseDir}/assets/data/GCA_000840245.1_ViralProj14204_genomic.fna.gz
Taxonomic classification is disabled by default. You have to specify one of the options below to activate it.
Database for taxonomic binning with centrifuge.
string
Database for taxonomic binning with kraken2.
string
Skip creating a krona plot for taxonomic binning.
boolean
Database for taxonomic classification of metagenome assembled genomes.
string
Co-assemble samples within one group, instead of assembling each sample separately.
boolean
Additional custom options for SPAdes.
string
""
Additional custom options for MEGAHIT.
string
""
Skip Illumina-only SPAdes assembly.
boolean
Skip SPAdes hybrid assembly.
boolean
Skip MEGAHIT assembly.
boolean
Skip metaQUAST.
boolean
Defines mapping strategy to compute co-abundances for binning, i.e. which samples will be mapped against the assembly.
string
group
Skip metagenome binning.
boolean
Minimum contig size to be considered for binning and for bin quality check.
integer
1500
Minimal length of contigs that are not part of any bin but treated as individual genome.
integer
1000000
Maximal number of contigs that are not part of any bin but treated as individual genome.
integer
100
Disable bin QC with BUSCO.
boolean
Download path for BUSCO database.
string
https://busco-data.ezlab.org/v4/data/lineages/bacteria_odb10.2020-03-06.tar.gz
Save BUSCO reference.
boolean
string