ZERO-INFLATED NEGATIVE BINOMIAL REGRESSION FOR DIFFERENTIAL ABUNDANCE TESTING IN MICROBIOME STUDIES

@inproceedings{Zhang2016ZEROINFLATEDNB,
  title={ZERO-INFLATED NEGATIVE BINOMIAL REGRESSION FOR DIFFERENTIAL ABUNDANCE TESTING IN MICROBIOME STUDIES},
  author={Xinyan Zhang and Himel Mallick and N. Yi},
  year={2016}
}
Motivation: The human microbiome plays an important role in human health and disease. The composition of the human microbiome is influenced by multiple factors and understanding these factors is critical to elucidate the role of the microbiome in health and disease and for development of new diagnostics or therapeutic targets based on the microbiome. 16S ribosomal RNA (rRNA) gene targeted amplicon sequencing is a commonly used approach to determine the taxonomic composition of the bacterial… Expand

Figures from this paper

An omnibus test for differential distribution analysis of microbiome sequencing data
TLDR
A novel test for differential distribution analysis of microbiome sequencing data by jointly testing the abundance, prevalence and dispersion is introduced, built on a zero-inflated negative binomial regression model and winsorized count data to account for zero- inflation and outliers. Expand
Bayesian Modeling of Microbiome Data for Differential Abundance Analysis.
The advances of next-generation sequencing technology have accelerated study of the microbiome and stimulated the high throughput profiling of metagenomes. The large volume of sequenced data hasExpand
An Adaptive Multivariate Two-Sample Test With Application to Microbiome Differential Abundance Analysis
TLDR
A novel Adaptive multivariate two-sample test for Microbiome Differential Analysis (AMDA) to examine whether the composition of a taxa-set are different between two conditions and demonstrated that the AMDA test was often more powerful than several competing methods while preserving the correct type I error rate. Expand
mbImpute: an accurate and robust imputation method for microbiome data
TLDR
This work proposes the first imputation method, mbImpute, to identify and recover likely non-biological zeros by borrowing information jointly from similar samples, similar taxa, and optional metadata including sample covariates and taxon phylogeny. Expand
Powerful and robust non-parametric association testing for microbiome data via a zero-inflated quantile approach (ZINQ)
Background Identification of bacterial taxa associated with diseases, exposures, and other variables of interest offers a more comprehensive understanding of the role of microbes in many conditions.Expand
NBZIMM: negative binomial and zero-inflated mixed models, with application to microbiome/metagenomics data analysis
TLDR
The main functions are built on top of the commonly used R packages nlme and MASS, allowing us to incorporate the well-developed analytic procedures into the framework for analyzing over-dispersed and zero-inflated count or proportion data with multilevel structures. Expand
Correlation and association analyses in microbiome study integrating multiomics in health and disease.
  • Yinglin Xia
  • Computer Science, Medicine
  • Progress in molecular biology and translational science
  • 2020
TLDR
An overall view of longitudinal methods in analysis of microbiome and omics data, which cover standard, static, regression-based time series methods, principal trend analysis, and newly developed univariate overdispersed and zero-inflated as well as multivariate distance/kernel-based longitudinal models are provided. Expand
Statistical methods for the analysis of microbiome compositional data in HIV studies
TLDR
This thesis is focused on the study of the link between the gut microbiome and HIV infection and understanding how gut microbiome alterations after HIV infection are related to immune dysregulation is of importance. Expand
Statistical analysis of microbiome data: The challenge of sparsity
TLDR
The unique feature and complexity of 16S ribosomal RNA gene sequence data, especially the sparsity of the data, present challenges to statistical analysis and interpretation and proper normalization is critical in ensuring the validity of downstream analysis. Expand
Identifying Gut Microbiota Associated With Colorectal Cancer Using a Zero-Inflated Lognormal Model
TLDR
The view that individual microbes as well as the overall structure of gut microbiota were co-evolving with CRC was strengthened, as the microbial diversity of healthy controls is significantly higher than that of the CRC patients, which indicated a significant negative correlation between gut microbiota diversity and the stage of CRC. Expand
...
1
2
...

References

SHOWING 1-10 OF 49 REFERENCES
Waste Not, Want Not: Why Rarefying Microbiome Data Is Inadmissible
TLDR
It is advocated that investigators avoid rarefying altogether and supported statistical theory is provided that simultaneously accounts for library size differences and biological variability using an appropriate mixture model. Expand
Data mining the human gut microbiota for therapeutic targets
TLDR
How computational methods could lead to a systems-level understanding of the global physiology of the host-microbiota superorganism in health and disease will provide a platform for the identification and development of new therapeutic strategies for chronic diseases possibly involving microbial as well as human-host targets that improve upon existing probiotics, prebiotics or antibiotics. Expand
Zero-Inflated Beta Regression for Differential Abundance Analysis with Metagenomics Data
TLDR
A zero-inflated beta regression approach (ZIBSeq) for identifying differentially abundant features between multiple clinical conditions and demonstrates better performance with large AUC values for most simulation studies. Expand
A core gut microbiome in obese and lean twins
TLDR
The faecal microbial communities of adult female monozygotic and dizygotic twin pairs concordant for leanness or obesity, and their mothers are characterized to address how host genotype, environmental exposure and host adiposity influence the gut microbiome. Expand
A robust approach for identifying differentially abundant features in metagenomic samples
TLDR
A novel method, Ratio Approach for Identifying Differential Abundance (RAIDA), which utilizes the ratio between features in a modified zero-inflated lognormal model and greatly surpasses other existing methods. Expand
Unravelling the effects of the environment and host genotype on the gut microbiome
TLDR
How host genetics and the environment shape the microbiota, and how these three factors may interact in the context of chronic disease are described. Expand
EM Adaptive LASSO—A Multilocus Modeling Strategy for Detecting SNPs Associated with Zero-inflated Count Phenotypes
TLDR
This article investigated the performance of several state-of-the-art approaches for handling zero-inflated count data along with a novel penalized regression approach with an adaptive LASSO penalty by simulating data under a variety of disease models and linkage disequilibrium patterns. Expand
Application of Two-Part Statistics for Comparison of Sequence Variant Counts
TLDR
The objective of this paper is to examine the application of a two-part statistic to identify taxa that differ between two groups, and shows superior performance of the two- part statistic for analysis of sequence data. Expand
Structural segregation of gut microbiota between colorectal cancer patients and healthy volunteers
TLDR
Reduction of butyrate-producing bacteria in the gut microbiota of CRC patients and increase of opportunistic pathogens may constitute a major structural imbalance of gut microbiota in CRC patients. Expand
The human microbiome: at the interface of health and disease
TLDR
The large-scale dynamics of the microbiome can be described by many of the tools and observations used in the study of population ecology, andiphering the metagenome and its aggregate genetic information can also be used to understand the functional properties of the microbial community. Expand
...
1
2
3
4
5
...