IFAA: Robust Association Identification and Inference for Absolute Abundance in Microbiome Analyses

  title={IFAA: Robust Association Identification and Inference for Absolute Abundance in Microbiome Analyses},
  author={Z. Li and L. Tian and A. J. O’Malley and M. Karagas and A. Hoen and B. Christensen and J. Madan and Quran Wu and R. Gharaibeh and C. Jobin and Hongzhe Li},
  journal={arXiv: Applications},
The target of inference in microbiome analyses is usually relative abundance (RA) because RA in a sample (e.g., stool) can be considered as an approximation of RA in an entire ecosystem (e.g., gut). However, inference on RA suffers from the fact that RA are calculated by dividing absolute abundances (AA) over the common denominator (CD), the summation of all AA (i.e., library size). Because of that, perturbation in one taxon will result in a change in the CD and thus cause false changes in RA… Expand
2 Citations

Figures and Tables from this paper

Correlation and association analyses in microbiome study integrating multiomics in health and disease.
  • Yinglin Xia
  • Computer Science, Medicine
  • Progress in molecular biology and translational science
  • 2020
An overall view of longitudinal methods in analysis of microbiome and omics data, which cover standard, static, regression-based time series methods, principal trend analysis, and newly developed univariate overdispersed and zero-inflated as well as multivariate distance/kernel-based longitudinal models are provided. Expand
Infant Feeding Alters the Longitudinal Impact of Birth Mode on the Development of the Gut Microbiota in the First Year of Life
It is observed that breastfeeding modified the longitudinal impact of delivery mode on the taxonomic composition of the microbiota by 1 year of age, with an observed increase in abundance of Bacteroides fragilis and Lactobacillus with longer duration of breastfeeding among CD infants while there was an increase in Faecalibacterium for VD infants. Expand


Analysis of composition of microbiomes: a novel method for studying microbial composition
The performance of ANCOM is illustrated using two publicly available microbial datasets in the human gut, demonstrating its general applicability to testing hypotheses about compositional differences in microbial communities and accounting for compositionality using log-ratio analysis results in significantly improved inference in microbiota survey data. Expand
Waste Not, Want Not: Why Rarefying Microbiome Data Is Inadmissible
It is advocated that investigators avoid rarefying altogether and supported statistical theory is provided that simultaneously accounts for library size differences and biological variability using an appropriate mixture model. Expand
Normalization and microbial differential abundance strategies depend upon data characteristics
These findings guide which normalization and differential abundance techniques to use based on the data characteristics of a given study. Expand
Microbiome, Metagenomics, and High-Dimensional Compositional Data Analysis
Important issues include normalization and quantification of relative taxa, bacterial genes, and metabolic abundances; incorporation of phylogenetic information into analysis of metagenomics data; and multivariate analysis of high-dimensional compositional data. Expand
Zero-inflated generalized Dirichlet multinomial regression model for microbiome compositional data analysis.
A new probability distribution is developed, zero-inflated generalized Dirichlet multinomial (ZIGDM), that overcomes limitations in modeling multivariate taxon counts and develops a fast expectation-maximization algorithm to efficiently estimate parameters in the model. Expand
Conditional Regression Based on a Multivariate Zero-Inflated Logistic-Normal Model for Microbiome Relative Abundance Data
This work proposes a multivariate two-part zero-inflated logistic-normal model to analyze the association of disease risk factors with individual microbial taxa and overall microbial community composition and shows that this model outperforms existing methods. Expand
Batch effects correction for microbiome data with Dirichlet‐multinomial regression
A new method, Bayesian Dirichlet‐multinomial regression meta‐analysis (BDMMA), to simultaneously model the batch effects and detect the microbial taxa associated with phenotypes and is robust to the high dimensionality of the microbiome and their association sparsity. Expand
Strains, functions and dynamics in the expanded Human Microbiome Project
A second wave of data from the National Institutes of Health Human Microbiome Project is introduced, comprising 1,631 new metagenomes (2,355 total) targeting diverse body sites with multiple time points in 265 individuals to provide new characterizations of microbiome personalization. Expand
Correcting for batch effects in case-control microbiome studies
Overall, it is shown that percentile-normalization is a simple, non-parametric approach for correcting batch effects and improving sensitivity in case-control meta-analyses. Expand
MedZIM: Mediation analysis for Zero-Inflated Mediators with applications to microbiome data
It is shown that the mediation effect of the microbiome can be decomposed into two components that are inherent to the two-part nature of zero-inflated distributions, and this approach outperforms existing mediation analysis approaches. Expand