Quantifying tumor heterogeneity in whole-genome and whole-exome sequencing data

  title={Quantifying tumor heterogeneity in whole-genome and whole-exome sequencing data},
  author={Layla Oesper and Gryte Satas and Benjamin J. Raphael},
  volume={30 24},
MOTIVATION Most tumor samples are a heterogeneous mixture of cells, including admixture by normal (non-cancerous) cells and subpopulations of cancerous cells with different complements of somatic aberrations. This intra-tumor heterogeneity complicates the analysis of somatic aberrations in DNA sequencing data from tumor samples. RESULTS We describe an algorithm called THetA2 that infers the composition of a tumor sample-including not only tumor purity but also the number and content of tumor… 

Figures and Tables from this paper

Accurate quantification of copy-number aberrations and whole-genome duplications in multi-sample tumor sequencing data

A new algorithm, Holistic Allele-specific Tumor Copy-number Heterogeneity (HATCHet), that infers allele and clone-specific CNAs and WGDs jointly across multiple tumor samples from the same patient, and that leverages the relationships between clones in these samples.

Clonality Inference from Single Tumor Samples Using Low Coverage Sequence Data

CTPsingle is shown to be able to infer the purity and the clonality of single-sample tumors with high accuracy even restricted to a coverage depth of \(\sim \)30x.

Clonality Inference from Single Tumor Samples Using Low-Coverage Sequence Data

CTPsingle is shown to be able to infer the purity and the clonality of single-sample tumors with high accuracy, even restricted to a coverage depth of ∼30 × .

Quantification of Multiple Tumor Clones Using Gene Array and Sequencing Data.

This article proposes a general Bayesian method for estimating copy number aberrations when there are normal cells and potentially more than one tumor clones, which is flexible and can work for both SNP array and next-generation sequencing data.

CLImAT-HET: detecting subclonal copy number alterations and loss of heterozygosity in heterogeneous tumor samples from whole-genome sequencing data

CLImAT-HET, a novel algorithm is introduced to infer CNA/LOH segments from heterogeneous tumor samples by taking into account the intra-tumor heterogeneity issue, in the case where a reference or matched normal sample is absent.

Computational Methods for Characterizing Cancer Mutational Heterogeneity

Computational methods that use somatic alterations measured through next-generation DNA sequencing technologies for characterizing tumor heterogeneity and its association with clinical variables are reviewed.

Reconstruction of clone- and haplotype-specific cancer genome karyotypes from bulk tumor samples

This work introduces a novel algorithm, Reconstructing Cancer Karyotypes (RCK), to reconstruct haplotype-specific karyotypes of one or more rearranged cancer genomes, or clones, that best explain the read alignments from a bulk tumor sample.

Deciphering intratumor heterogeneity using cancer genome analysis

The ability to identify and monitor the subclonal architecture of a tumor is valuable for the development of precise cancer therapeutic methods and single-cell sequencing will enable the identification of every minor population within a tumor microenvironment.

CloneCNA: detecting subclonal somatic copy number alterations in heterogeneous tumor samples from whole-exome sequencing data

CloneCNA, a novel algorithm developed to efficiently and accurately identify somatic copy number alterations from heterogeneous tumor samples, is introduced and the statistical framework of CloneCNA represents a remarkable advance for tumor whole-exome sequencing data.



THetA: inferring intra-tumor heterogeneity from high-throughput DNA sequencing data

THetA (Tumor Heterogeneity Analysis), an algorithm that infers the most likely collection of genomes and their proportions in a sample, for the case where copy number aberrations distinguish subpopulations, is introduced.

Absolute quantification of somatic DNA alterations in human cancer

A computational method that infers tumor purity and malignant cell ploidy directly from analysis of somatic DNA alterations is described, revealing that genome-doubling events are common in human cancer, likely occur in cells that are already aneuploid, and influence pathways of tumor progression.

A combinatorial approach for analyzing intra-tumor heterogeneity from high-throughput sequencing data

The binary tree partition (BTP) is introduced, a novel combinatorial formulation of the problem of constructing the subpopulations of tumor cells from the variant allele frequencies of somatic mutations, and it is shown that finding a BTP is an NP-complete problem.

Intratumor heterogeneity and branched evolution revealed by multiregion sequencing.

Intratumor heterogeneity can lead to underestimation of the tumor genomics landscape portrayed from single tumor-biopsy samples and may present major challenges to personalized-medicine and biomarker development.

Correcting for cancer genome size and tumour cell content enables better estimation of copy number alterations from next-generation sequence data

A method that corrects contamination with normal cells and adjusts for genomes of different sizes so that the actual copy number of each region can be estimated, and proves a powerful tool when analysing publicly available data from two cell lines.

Analysis of next-generation genomic data in cancer: accomplishments and challenges.

The challenge will be to intelligently integrate data types and genomes to produce a coherent picture of the genetic basis of cancer.

Copy number variation detection in whole-genome sequencing data using the Bayesian information criterion

This study developed an algorithm to detect CNVs from whole-genome sequencing data and applied it to a newly sequenced glioblastoma genome with a matched control, which can accurately and efficiently identify CNVs via minimizing the Bayesian information criterion.

An assessment of computational methods for estimating purity and clonality using genomic data derived from heterogeneous tumor tissue samples

Several computational tools, which enable deconvolution of genomic and transcriptomic data from heterogeneous samples, are discussed and their potentials to complement single-cell genomics and immunoFISH analyses, and provide novel insights into tumor heterogeneity are discussed.

EXPANDS: expanding ploidy and allele frequency on nested subpopulations

The results inform about the extent of subclonal diversity in primary glioblastoma, subpopulation dynamics during recurrence and provide a set of candidate genes mutated in the most well-adapted subpopulations.

Comprehensive molecular portraits of human breast tumors

The ability to integrate information across platforms provided key insights into previously defined gene expression subtypes and demonstrated the existence of four main breast cancer classes when combining data from five platforms, each of which shows significant molecular heterogeneity.