Circular binary segmentation for the analysis of array-based DNA copy number data.

  title={Circular binary segmentation for the analysis of array-based DNA copy number data.},
  author={Adam B. Olshen and E. S. Venkatraman and Robert Lucito and Michael Wigler},
  volume={5 4},
DNA sequence copy number is the number of copies of DNA at a region of a genome. Cancer progression often involves alterations in DNA copy number. Newly developed microarray technologies enable simultaneous measurement of copy number at thousands of sites in a genome. We have developed a modification of binary segmentation, which we call circular binary segmentation, to translate noisy intensity measurements into regions of equal copy number. The method is evaluated by simulation and is… 

Figures and Tables from this paper

Genome analysis Detection of DNA copy number alterations using penalized least squares regression

This paper incorporates the spatial dependence of DNA copy number data into a regression model and formalizes the detection of DNAcopy number alterations as a penalized least squares regression problem.

A Linear-Time Algorithm for Analyzing Array CGH Data Using Log Ratio Triangulation

This work proposes a linear-time algorithm to identify chromosomal change points using array comparative genomic hybridization (aCGH) data that treats log-2 ratio values as points in a triangle and segments the genome into regions of equal copy number by exploiting the properties of log- 2 ratio values often seen at segment boundaries.

A Novel Algorithm For the Analysis Of Array Cgh Data

A novel algorithm is introduced to automatically identify the regions of DNA copy number gain and loss from array CGH data through a multi-scale edge detection algorithm and shows that this method can be successfully applied for the analysis of array C GH biological data.

A Novel Approach to DNA Copy Number Data Segmentation

A novel model-based method using the minimum description length (MDL) principle for DCN data segmentation that can output underlying DCN for each chromosomal segment, and at the same time, infer the underlying tumor proportion in the test samples.


There is a need for algorithms that can identify gains and losses in the number of copies based on statistical considerations, rather than merely detect trends in the data, and the proposed study introduces three different approaches; Circular binary segmentation, Bayesian approach, and circle-based comparative genomic hybridization.

Reconstructing DNA copy number by joint segmentation of multiple sequences

A segmentation method named generalized fused lasso (GFL) to reconstruct copy number variant regions is presented, based on penalized estimation and is capable of processing multiple signals jointly, and is particularly useful in the initial screening stages of large data sets.

Computational Analysis of Genome-Wide DNA Copy Number Changes

Two definitions of chromosome instability (CIN) index are introduced to mathematically and quantitatively characterize genome-wide genomic instability and are expected to be predictors in tumors subtype diagnosis and to be a useful tool in cancer research.

A hierarchical clustering method for estimating copy number variation.

A marker clustering method which consists of a "circular clustering tree test statistic" attaches a statistic to each marker that measures the likelihood that it is a change point, and provides a new way to build up a binary tree that can accurately capture change-point signals and is easy to perform.

DNA Copy Number Profiling in Normal and Tumor Genomes

This chapter reviews the computational and statistical problems that arise in DNA copy number data and surveys recent advances in their treatment.

Preprocessing and downstream analysis of microarray DNA copy number profiles

The focus narrows to applications of DNA copy number in cancer, in particular, several downstream analyses of multi-sample data sets such as testing, clustering and classification.

Genome-wide analysis of DNA copy-number changes using cDNA microarrays

This work describes a cDNA microarray-based CGH method, and its application to DNA copy-number variation analysis in breast cancer cell lines and tumours, and identifies gene amplifications and deletions genome-wide and with high resolution.

Detecting gene copy number fluctuations in tumor cells by microarray analysis of genomic representations.

It is demonstrated that arrays of DNA probes derived from low-complexity representations can be used to detect amplifications, deletions, and polymorphic differences when hybridized to representations of genomic DNA.

High resolution analysis of DNA copy number variation using comparative genomic hybridization to microarrays

The implementation of array CGH is demonstrated to be able to measure copy number with high precision in the human genome, and to analyse clinical specimens by obtaining new information on chromosome 20 aberrations in breast cancer.

Assembly of microarrays for genome-wide measurement of DNA copy number

The arrays provide precise measurement in cell lines and clinical material, so that they can reliably detect and quantify high-level amplifications and single-copy alterations in diploid, polyploid and heterogeneous backgrounds.

Representational oligonucleotide microarray analysis: a high-resolution method to detect genome copy number variation.

ROMA (representational oligonucleotide microarray analysis) will assist in the discovery of genes and markers important in cancer, and theiscovery of loci that may be important in inherited predisposition to disease.

Microarray analysis reveals a major direct role of DNA copy number alteration in the transcriptional program of human breast tumors

Evidence is provided that widespread DNA copy number alteration can lead directly to global deregulation of gene expression, which may contribute to the development or progression of cancer.

Comparative genomic hybridization for molecular cytogenetic analysis of solid tumors.

Comparative genomic hybridization produces a map of DNA sequence copy number as a function of chromosomal location throughout the entire genome, which identified 16 different regions of amplification, many in loci not previously known to be amplified.

Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation.

This article proposes normalization methods that are based on robust local regression and account for intensity and spatial dependence in dye biases for different types of cDNA microarray experiments.

Determination of amplicon boundaries at 20q13.2 in tissue samples of human gastric adenocarcinomas by high‐resolution microarray comparative genomic hybridization

An amplicon at 20q13.2 has been narrowed down to 800 kb which is likely to harbour one or more putative oncogenes relevant to gastric carcinogenesis, for which ZNF217 and CYP24 are good candidates.

Genome scanning with array CGH delineates regional alterations in mouse islet carcinomas

Compar genomic hybridization to DNA microarrays (array CGH) overcomes limitations by allowing efficient, genome-wide analyses of relative genome copy number and suggests that CYP24, PFDN4, STMN1, CDKN1B, PPP2R3 and FSTL1 are candidate oncogenes or tumor-suppressor genes.