• Corpus ID: 244463089

Population based change-point detection for the identification of homozygosity islands

  title={Population based change-point detection for the identification of homozygosity islands},
  author={Lucas Henrique Figueiredo Prates and Renan Barbosa Lemes and T{\'a}bita Hunemeier and Florencia G. Leonardi},
In this paper, we propose a new method for offline change-point detection on some parameters of the distribution of a random vector. We introduce a penalized maximum likelihood approach that can be efficiently computed by a dynamic programming algorithm or approximated by a fast greedy binary splitting algorithm. We prove both algorithms converge almost surely to the set of change-points under very general assumptions on the distribution and independent sampling of the random vector. In… 

Figures from this paper



A model selection approach for multiple sequence segmentation and dimensionality reduction

Multiple Change-point Detection: a Selective Overview

This article provides an in-depth discussion on a normal mean change-point model from aspects of regression analysis, hypothesis testing, consistency and inference, and presents a strategy to gather and aggregate local information for change- point detection that has become the cornerstone of several emerging methods.

Inbreeding estimates in human populations: Applying new approaches to an admixed Brazilian isolate

The results suggest that the amount of ROH <2Mb of admixed populations should be somehow proportional to the genetic contribution from each parental population.

A Nonparametric Approach for Multiple Change Point Analysis of Multivariate Data

The divisive method is shown to provide consistent estimates of both the number and the location of change points under standard regularity assumptions, and methods from cluster analysis are applied to assess performance and to allow simple comparisons of location estimates, even when the estimated number differs.

Optimal detection of changepoints with a linear computational cost

This work considers the problem of detecting multiple changepoints in large data sets and introduces a new method for finding the minimum of such cost functions and hence the optimal number and location of changepoints that has a computational cost which is linear in the number of observations.

Multiscale change point inference

A new estimator, the simultaneous multiscale change point estimator SMUCE, is introduced, which achieves the optimal detection rate of vanishing signals as n→∞, even for an unbounded number of change points.

Runs of homozygosity in European populations.

Independent block identification in multivariate time series

This work proposes a model selection criterion to estimate the points of independence of a random vector, producing a decomposition of the vector distribution function into independent blocks, and proves the consistency of the approach under general conditions on the estimator of the distribution function.