• Corpus ID: 55840950

Bayesian Modeling and MCMC Computation in Linear Logistic Regression for Presence-only Data

  title={Bayesian Modeling and MCMC Computation in Linear Logistic Regression for Presence-only Data},
  author={Fabio Divino and Natalia Golini and Giovanna Jona Lasinio and Antti Penttinen},
  journal={arXiv: Computation},
Presence-only data are referred to situations in which, given a censoring mechanism, a binary response can be observed only with respect to on outcome, usually called \textit{presence}. In this work we present a Bayesian approach to the problem of presence-only data based on a two levels scheme. A probability law and a case-control design are combined to handle the double source of uncertainty: one due to the censoring and one due to the sampling. We propose a new formalization for the logistic… 

Figures and Tables from this paper

Predicting the Geographical Distribution of Two Invasive Termite Species from Occurrence Data
The Bayesian logistic approach outperformed Maxent consistently in all models according to evaluation criteria such as model sensitivity and ecological realism.
A Probabilistic Rating System for Team Competitions with Individual Contributions
This work begins with a simplistic naive Bayes approach which is this case reduces to logistic regression and then develops it into a more complex model with latent variables trained by expectation–maximization.


Case-control studies with contaminated controls☆
A Bayesian Model for Presence-Only Semicontinuous Data, With Application to Prediction of Abundance of Taxus Baccata in Two Italian Regions
In studies about the potential distribution of ecological niches, only the presence of the species of interest is usually recorded. Pseudo-absences are sampled from the study area in order to avoid
Poisson point process models solve the “pseudo-absence problem” for presence-only data in ecology
It is shown that as the number of pseudo-absences increases (in a regular or uni-form random arrangement), logistic regression slope parameters and standard errors converge to those of the corresponding Poisson point process model.
Presence-only data and the em algorithm.
An expectation-maximization algorithm is proposed to estimate the underlying presence-absence logistic model for presence-only data and it is shown that the population prevalence of a species is only identifiable when there is some unrealistic constraint on the structure of theLogistic model.
Parameter Expansion for Data Augmentation
A parameter expanded data augmentation (PX-DA) algorithm is rigorously defined and a new theory for iterative conditional sampling under the tra… to understand the role of the expansion parameter.
Modelling distribution and abundance with presence‐only data
A synthesis and critique of statistical methods currently in use to both estimate and evaluate these models are provided, and the critical importance of study design in models where only presence can be identified is discussed.
Summary Which of the two model-based inference arguments in survey sampling is the “correct” one? The answer seems simple once the goal of the estimation procedure has been clearly defined, but
Data Augmentation Approach in Bayesian Modelling of Presence-only Data
Predicting the Geographic Distribution of a Species from Presence‐Only Data Subject to Detection Errors
Analytical results ofalyses of presence-only data of three avian species observed in a survey of landbirds in western Montana and northern Idaho are compared with site-occupancy analyses of detections and nondetections of these species.
Point pattern modelling for degraded presence‐only data over large regions
A hierarchical model is used to treat the presence data as a realization of a spatial point process, whose intensity is governed by the set of environmental covariates and the opportunity to infer about biodiversity through species richness, i.e. the number of distinct species in an areal unit is investigated.