Parallel-META 3: Comprehensive taxonomical and functional analysis platform for efficient comparison of microbial communities

@article{Jing2017ParallelMETA3C,
  title={Parallel-META 3: Comprehensive taxonomical and functional analysis platform for efficient comparison of microbial communities},
  author={Gongchao Jing and Zheng Sun and Honglei Wang and Yanhai Gong and Shi Huang and Kang Ning and Jian Xu and Xiaoquan Su},
  journal={Scientific Reports},
  year={2017},
  volume={7}
}
The number of metagenomes is increasing rapidly. However, current methods for metagenomic analysis are limited by their capability for in-depth data mining among a large number of microbiome each of which carries a complex community structure. Moreover, the complexity of configuring and operating computational pipeline also hinders efficient data processing for the end users. In this work we introduce Parallel-META 3, a comprehensive and fully automatic computational toolkit for rapid data… 
MetaGenomic analysis of short and long reads
TLDR
Evaluation of MetaG’s performance showed that it makes nearly perfect classifications of viral isolates using simulated short and long reads, and outperformed current state-of-the-art algorithms on data from targeted sequencing of the 16S and 28S rRNA genes.
Analysis of sequencing strategies and tools for taxonomic annotation: Defining standards for progressive metagenomics
TLDR
An impartial comparison is performed, using statistical measures of classification for eight bioinformatic tools and four taxonomic databases, defining a benchmark framework to evaluate each tool in a standardized context and stressing out the importance of gold standards, database curation and manual inspection of taxonomic profiling results, for a better and more accurate microbial diversity description.
Reply to Sun et al., “Identifying Composition Novelty in Microbiome Studies: Improvement of Prediction Accuracy”
TLDR
To quantitatively measure the beta diversities between microbiomes, Microbiome Search Engine (MSE) calculates phylogeny similarity using operational taxonomy unit (OTU) profiles and the speed is optimized by nonrecursive computing to enable real-time responses.
Meta-Apo improves accuracy of 16S-amplicon-based prediction of microbiome function
TLDR
Meta-Apo can produce a reliable, high-resolution view of microbiome function equivalent to that offered by shotgun WGS, and suggests that large-scale, function-oriented microbiome sequencing projects can probably benefit from the lower cost of 16S-amplicon strategy, without sacrificing the precision in functional reconstruction that otherwise requires WGS.
Meta-network: optimized species-species network analysis for microbial communities
TLDR
The Meta-Network framework is proposed, which has consistently constructed more meaningful networks with biologically important clusters, hubs, and provides a general approach towards deciphering the species-species co-occurrence networks.
Marine biofilms constitute a bank of hidden microbial diversity and functional potential
TLDR
This work analyzes 2.5 terabases of newly sequenced datasets and the Tara Oceans metagenomes to study biofilm-forming marine microbes, increasing the known microbial diversity in the oceans by more than 20% and revealing new biosynthetic gene clusters and CRISPR-Cas systems.
Fueling ab initio folding with marine metagenomics enables structure and function predictions of new protein families
TLDR
These results demonstrate a new avenue to improve the capacity of protein structure and function modeling through marine metagenomics, especially for difficult proteins with few homologous sequences.
Correlation-Centric Network (CCN) representation for microbial co-occurrence patterns: new insights for microbial ecology
TLDR
Investigating the Correlation-Centric Network (CCN) and their corresponding SCNs on two large cohorts of microbiome showed that CCNs not only retained the characteristics of SCNs, but also contained information that cannot be detected by SCNs.
...
...

References

SHOWING 1-10 OF 26 REFERENCES
Parallel-META 2.0: Enhanced Metagenomic Data Analysis with Functional Annotation, High Performance Computing and Advanced Visualization
TLDR
Parallel-META 2.0 is an extended and improved version of parallel computing software package, which enhances the taxonomical analysis using multiple databases, improves computation efficiency by optimized parallel computing, and supports interactive visualization of results in multiple views and enables functional analysis for metagenomic samples.
Parallel-META: efficient metagenomic data analysis based on high-performance computation
TLDR
This paper proposed Parallel-META, a GPU- and multi-core-CPU-based open-source pipeline for metagenomic data analysis, which enabled the efficient and parallel analysis of multiple meetagenomic datasets and the visualization of the results for multiple samples.
Computational meta'omics for microbial community studies
TLDR
The technological and computational meta'omics approaches that are already available, those that are under active development, their success in biological discovery, and several outstanding challenges are reviewed.
Meta-Storms: efficient search for similar microbial communities based on a novel indexing scheme and similarity score for metagenomic data
TLDR
The experimental results show that Meta-Storms is capable of database creation and effective searching for a large number of metagenomic samples, and it could achieve similar accuracies compared with the current popular significance testing-based methods.
A Guide to Enterotypes across the Human Body: Meta-Analysis of Microbial Community Structures in Human Microbiome Datasets
TLDR
This work tested how the following factors influenced the detection of enterotypes: clustering methodology, distance metrics, OTU-picking approaches, sequencing depth, data type (whole genome shotgun (WGS) vs.16S rRNA gene sequence data), and 16S r RNA region.
SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB
TLDR
SILVA (from Latin silva, forest), was implemented to provide a central comprehensive web resource for up to date, quality controlled databases of aligned rRNA sequences from the Bacteria, Archaea and Eukarya domains.
Introducing mothur: Open-Source, Platform-Independent, Community-Supported Software for Describing and Comparing Microbial Communities
TLDR
M mothur is used as a case study to trim, screen, and align sequences; calculate distances; assign sequences to operational taxonomic units; and describe the α and β diversity of eight marine samples previously characterized by pyrosequencing of 16S rRNA gene fragments.
Predictive functional profiling of microbial communities using 16S rRNA marker gene sequences
TLDR
The results demonstrate that phylogeny and function are sufficiently linked that this 'predictive metagenomic' approach should provide useful insights into the thousands of uncultivated microbial communities for which only marker gene surveys are currently available.
KEGG for integration and interpretation of large-scale molecular data sets
TLDR
KEGG Mapper, a collection of tools for KEGG PATHWAY, BRITE and MODULE mapping, enabling integration and interpretation of large-scale data sets and recent enhancements to the K EGG content, especially the incorporation of disease and drug information used in practice and in society, to support translational bioinformatics.
IMG: the integrated microbial genomes database and comparative analysis system
TLDR
The Integrated Microbial Genomes system serves as a community resource for comparative analysis of publicly available genomes in a comprehensive integrated context and provides tools and viewers for analyzing and reviewing the annotations of genes and genomes inA comparative context.
...
...