Taxonomic binning of metagenome samples generated by next-generation sequencing technologies

  title={Taxonomic binning of metagenome samples generated by next-generation sequencing technologies},
  author={Johannes Dr{\"o}ge and Alice Mchardy},
  journal={Briefings in bioinformatics},
  volume={13 6},
Metagenome research uses random shotgun sequencing of microbial community DNA to study the genetic sequences of its members without cultivation. This development has been strongly supported by improvements in sequencing technologies, which have rendered sequencing cheaper than before. As a consequence, downstream computational analysis of metagenome sequence samples is now faced with large amounts of complex data. One of the essential steps in metagenome analysis is reconstruction of draft… 

Tables from this paper

Taxator-tk: precise taxonomic assignment of metagenomes by fast approximation of evolutionary neighborhoods

An algorithm and the accompanying software, taxator-tk, which performs taxonomic sequence assignment by fast approximate determination of evolutionary neighbors from sequence similarities, which is well suited for profiling microbial communities from metagenome samples.

Metagenomic approaches in microbial ecology: an update on whole-genome and marker gene sequencing analyses

The primary workflows and software used for both approaches are reviewed, including whole-genome shotgun sequencing and marker gene, and the current challenges in the field are discussed.

Taxator-tk: Fast and Precise Taxonomic Assignment of Metagenomes by Approximating Evolutionary Neighborhoods

An algorithm and the accompanying software, taxator-tk, which performs taxonomic sequence assignments by fast approximate determination of evolutionary neighbors from sequence similarities and was precise in its taxonomic assignment across all ranks and taxa for a range of evolutionary distances and for short sequences.

Sequencing genomes from mixed DNA samples - evaluating the metagenome skimming approach in lichenized fungi

The metagenome skimming approach, i.e. low coverage shotgun sequencing of multi-species assemblages and subsequent reconstruction of individual genomes, is increasingly used for in-depth genomic characterization of ecological communities and was able to reconstruct fungal genomes from uncultured lichen thalli and cover most of the gene space.

Reconstruction of Bacterial and Viral Genomes from Multiple Metagenomes

The Binning-Assembly approach has been proposed and demonstrated for the reconstruction of bacterial and viral genomes from 72 human gut metagenomic datasets and it was demonstrated to be useful in improving the draft assembly of a bacterial genome.

Metagenomic Profiling, Interaction of Genomics with Meta-genomics

Metagenomics is about the sequencing and characterization of genomic DNA of uncultured microbes sampled directly from their habitats. Next-generation sequencing (NGS) technologies and the ability of

Strain- and plasmid-level deconvolution of a synthetic metagenome by sequencing proximity ligation products

Hi-C sequencing data provide valuable information for metagenome analyses that are not currently obtainable by other methods, and it is demonstrated that Hi-C data provides a long-range signal of strain-specific genotypes, indicating such data may be useful for high-resolution genotyping of microbial populations.

Metagenome and Metatranscriptome Analyses Using Protein Family Profiles

A novel homology detection algorithm that integrates banded Viterbi algorithm for profile HMM parsing with an iterative simultaneous alignment and assembly computational framework that accurately estimates the abundances of the antimicrobial resistance (AMR) gene families and enables accurate characterization of the resistome profiles of these microbial communities.



Genovo: De Novo Assembly for Metagenomes

Genovo is presented, a novel de novo sequence assembler that discovers likely sequence reconstructions under the model and its reconstructions cover more bases and recover more genes than the other methods, even for low-abundance sequences, and yield a higher assembly score.

Accurate phylogenetic classification of variable-length DNA fragments

PhyloPythia is presented, a composition-based classifier that combines higher-level generic clades from a set of 340 completed genomes with sample-derived population models and allows the accurate classification of most sequence fragments across all considered taxonomic ranks, even for unknown organisms.

A Bioinformatician's Guide to Metagenomics

The chain of decisions accompanying a metagenomic project from the viewpoint of the bioinformatic analysis step by step is described, with recommendations for sampling and data generation including sample and metadata collection, community profiling, construction of shotgun libraries, and sequencing strategies.

Phymm and PhymmBL: Metagenomic Phylogenetic Classification with Interpolated Markov Models

Phymm, a classifier for metagenomic data, is presented that has been trained on 539 complete, curated genomes and can accurately classify reads as short as 100 base pairs, a substantial improvement over previous composition-based classification methods.

Taxonomic classification of metagenomic shotgun sequences with CARMA3

CARMA3 is presented, a new method for the taxonomic classification of assembled and unassembled metagenomic sequences that has been adapted to work with both BLAST and HMMER3 homology searches and it is shown that the method makes fewer wrong taxonomic predictions than other BLAST-based methods.

A phylogeny-driven genomic encyclopaedia of Bacteria and Archaea

The results strongly support the need for systematic ‘phylogenomic’ efforts to compile a phylogeny-driven ‘Genomic Encyclopedia of Bacteria and Archaea’ in order to derive maximum knowledge from existing microbial genome data as well as from genome sequences to come.

Meta-IDBA: a de Novo assembler for metagenomic data

Comparison of the performances of Meta-IDBA and existing assemblers, such as Velvet and Abyss for different metagenomic datasets shows that Meta- IDBA can reconstruct longer contigs with similar accuracy.

Bambus 2: scaffolding metagenomes

A new scaffolder is presented, Bambus 2, to address some of the challenges encountered when analyzing metagenomes and demonstrates that the repeat detection algorithms have higher sensitivity than current approaches without sacrificing specificity.

The metagenomics RAST server – a public resource for the automatic phylogenetic and functional analysis of metagenomes

The open-source metagenomics RAST service provides a new paradigm for the annotation and analysis of metagenomes that is stable, extensible, and freely available to all researchers.

Metagenomic Analyses: Past and Future Trends

The employment of next-generation sequencing techniques for metagenomics resulted in the generation of large sequence data sets derived from various environments, such as soil, the human body, and ocean water, which opened a window into the enormous taxonomic and functional diversity of environmental microbial communities.