Analysing Microbial Community Composition through Amplicon Sequencing: From Sampling to Hypothesis Testing

@article{Hugerth2017AnalysingMC,
  title={Analysing Microbial Community Composition through Amplicon Sequencing: From Sampling to Hypothesis Testing},
  author={Luisa Warchavchik Hugerth and Anders F. Andersson},
  journal={Frontiers in Microbiology},
  year={2017},
  volume={8}
}
Microbial ecology as a scientific field is fundamentally driven by technological advance. The past decade's revolution in DNA sequencing cost and throughput has made it possible for most research groups to map microbial community composition in environments of interest. However, the computational and statistical methodology required to analyse this kind of data is often not part of the biologist training. In this review, we give a historical perspective on the use of sequencing data in… 

Figures from this paper

Enhancing diversity analysis by repeatedly rarefying next generation sequencing data describing microbial communities
TLDR
Repeated rarefying is proposed as a tool to normalize library sizes for diversity analyses to enable proportionate representation of all observed sequences and characterization of the random variation introduced to diversity analyses by rarefied to a smaller library size shared by all samples.
Microbial community analysis using high-throughput sequencing technology: a beginner’s guide for microbiologists
TLDR
This review presents microbiologists who are unfamiliar with in silico analyses with a basic and practical SMS data analysis protocol that covers all the bioinformatics processes of the SMS analysis in terms of data preprocessing, taxonomic profiling, functional annotation, and visualization.
DNA Metabarcoding for the Characterization of Terrestrial Microbiota—Pitfalls and Solutions
TLDR
The aim of this review is to provide state-of-the-art background knowledge needed to make appropriate decisions at each step of a DNA metabarcoding workflow, highlighting crucial steps that, if considered, ensures an accurate and standardized characterization of microbiota in environmental studies.
To rarefy or not to rarefy: Enhancing microbial community analysis through next-generation sequencing
TLDR
It is proposed that repeatedly rarefying enables characterization of the variation introduced to diversity analyses by this random subsampling and selection of smaller library sizes where necessary to incorporate all samples in the analysis.
To rarefy or not to rarefy: Enhancing diversity analysis of microbial communities through next-generation sequencing and rarefying repeatedly
TLDR
Repeated rarefying is proposed as a tool for diversity analyses to normalize library sizes to enable proportionate representation of all observed sequences and characterization of the random variation introduced to diversity analyses by rarefied to a smaller library size shared by all samples.
A Bioinformatics Guide to Plant Microbiome Analysis
TLDR
This work discusses sequencing and “omics” techniques with a focus on the requirements needed to adapt these methods to individual research approaches and highlights data integration and complementarity which should help to widen the understanding of the interactions among individual players of the plant holobiont in the future.
Best practices in metabarcoding of fungi: From experimental design to results
TLDR
It is demonstrated that operational taxonomic units (OTUs) outperform amplified sequence variants (ASVs) in recovering fungal diversity and specific methods for compositional data analyses provide more reliable estimates of shifts in community structure.
Acid Mine Drainage as Habitats for Distinct Microbiomes: Current Knowledge in the Era of Molecular and Omic Technologies
TLDR
A critical synopsis of the current applications of advanced molecular techniques in probing microbial and functional diversity in AMD environments is presented.
Sequencing Depth Has a Stronger Effect than DNA Extraction on Soil Bacterial Richness Discovery
TLDR
An optimisation of the soil metagenomics workflow is needed in order to sequence at a sufficient and equal depth to improve the accuracy of metagenomic comparisons and soil microbiome profiles.
...
...

References

SHOWING 1-10 OF 224 REFERENCES
Which sequencing depth is sufficient to describe patterns in bacterial α- and β-diversity?
TLDR
Diversity estimates varied with sequencing depth, yet, trends in diversity among samples were less sensitive; it was found that 1000 denoised sequences per sample explained to 90% the trends in β-diversity (Bray-Curtis index) among samples observed for 15’000-20 000 sequences.
Fungal community analysis by high-throughput sequencing of amplified markers – a user's guide
TLDR
Taking advantage of the collective experience of six research groups, the different stages involved in fungal community analysis are reviewed, from field sampling via laboratory procedures to bioinformatics and data interpretation, and potential pitfalls, alternatives, and solutions are discussed.
phyloseq: An R Package for Reproducible Interactive Analysis and Graphics of Microbiome Census Data
TLDR
The phyloseq project for R is a new open-source software package dedicated to the object-oriented representation and analysis of microbiome census data in R, which supports importing data from a variety of common formats, as well as many analysis techniques.
Exploring the composition and diversity of microbial communities at the Jan Mayen hydrothermal vent field using RNA and DNA.
TLDR
How different methods can provide complementary ecological insights is illustrated by the composition and diversity of microbial communities from the recently discovered Jan Mayen vent field, located on Mohn's Ridge in the Norwegian-Greenland Sea.
Inferring Correlation Networks from Genomic Survey Data
TLDR
It is shown that community diversity is the key factor that modulates the acuteness of such compositional effects, and a new approach is developed, called SparCC, which is capable of estimating correlation values from compositional data.
Application of multivariate statistical techniques in microbial ecology
TLDR
This review describes and compares the most widely used multivariate statistical techniques including exploratory, interpretive and discriminatory procedures, and presents examples of how these approaches have been utilized in recent studies to provide insight into the ecology of the microbial world.
Tax4Fun: predicting functional profiles from metagenomic 16S rRNA data
TLDR
The results indicate that Tax4Fun provides a good approximation to functional profiles obtained from metagenomic shotgun sequencing approaches, and is a software package that predicts the functional capabilities of microbial communities based on 16S rRNA datasets.
Multiple marker parallel tag environmental DNA sequencing reveals a highly complex eukaryotic community in marine anoxic water
TLDR
It is suggested that current understanding of the ecological complexity of protist communities, genetic diversity, and global species richness are severely limited by the sequence data hitherto available, and long‐tailed rank abundance curves suggest that the 454 sequencing approach provides improved access to rare genotypes.
Microbial community resemblance methods differ in their ability to detect biologically relevant patterns
TLDR
It is found that many diversity patterns were evident with severely undersampled communities and that methods varied widely in their ability to detect gradients and clusters, including Chi-squared distances and Pearson correlation distances, whereas Gower and Canberra distances performed especially well for detecting clusters.
Reagent and laboratory contamination can critically impact sequence-based microbiome analyses
TLDR
It is demonstrated that contaminating DNA is ubiquitous in commonly used DNA extraction kits and other laboratory reagents, varies greatly in composition between different kits and kit batches, and that this contamination critically impacts results obtained from samples containing a low microbial biomass.
...
...