Fast and scalable inference of multi-sample cancer lineages

@article{Popic2015FastAS,
  title={Fast and scalable inference of multi-sample cancer lineages},
  author={Victoria Popic and Raheleh Salari and Iman Hajirasouliha and Dorna Kashef Haghighi and Robert B. West and Serafim Batzoglou},
  journal={Genome Biology},
  year={2015},
  volume={16}
}
Somatic variants can be used as lineage markers for the phylogenetic reconstruction of cancer evolution. Since somatic phylogenetics is complicated by sample heterogeneity, novel specialized tree-building methods are required for cancer phylogeny reconstruction. We present LICHeE (Lineage Inference for Cancer Heterogeneity and Evolution), a novel method that automates the phylogenetic inference of cancer progression from multiple somatic samples. LICHeE uses variant allele frequencies of… 
Deconvolution and phylogeny inference of structural variations in tumor genomic samples
TLDR
A novel method for reconstructing evolutionary trajectories of SVs from bulk whole-genome sequence data via joint deconvolution and phylogenetics, to infer clonal subpopulations and reconstruct their ancestry is presented.
Phylogenetic Tree Inference: A Top-Down Approach to Track Tumor Evolution
TLDR
A novel and easy-to-use method, PTI (Phylogenetic Tree Inference), which use an iterative top-down approach to infer the phylogenetic tree structure of multiple tumor biopsies from same patient using just the presence or absence of somatic mutations without their allele frequencies.
Clonality Inference from Single Tumor Samples Using Low-Coverage Sequence Data
TLDR
CTPsingle is shown to be able to infer the purity and the clonality of single-sample tumors with high accuracy, even restricted to a coverage depth of ∼30 × .
Clonality Inference from Single Tumor Samples Using Low Coverage Sequence Data
TLDR
CTPsingle is shown to be able to infer the purity and the clonality of single-sample tumors with high accuracy even restricted to a coverage depth of \(\sim \)30x.
Using LICHeE and BAMSE for Reconstructing Cancer Phylogenetic Trees
TLDR
This unit provides extensive guidelines for installing and running both LICHeE and BAMSE, two computational tools designed and implemented recently for the reconstruction of cancer phylogeny trees and quantifying the evolution of the disease.
Abstract 2374: Reconstructing the evolutionary history of metastatic cancers
TLDR
The new approach described here efficiently reconstructs the evolutionary history of metastases, detects potential artifacts in noisy high-throughput sequencing data, and finds subclones of distinct origin and shows that classical distance-based phylogenetic methods can produce evolutionarily implausible results.
BAMSE: Bayesian model selection for tumor phylogeny inference among multiple samples
TLDR
BAMSE (BAyesian Model Selection for tumor Evolution), a new probabilistic method for inferring subclonal history and lineage tree reconstruction of heterogeneous tumor samples accurately and efficiently is presented.
Tumor phylogeny inference using tree-constrained importance sampling
TLDR
On real data from a chronic lymphocytic leukemia (CLL) patient, it is demonstrated on simulated data that PASTRI outperforms other cancer phylogeny algorithms in terms of runtime and accuracy, and a simple linear phylogeny better explains the data the complex branching phylogeny that was previously reported.
PyClone-VI: Scalable inference of clonal population structures using whole genome data
TLDR
The proposed PyClone-VI, a computationally efficient Bayesian statistical method for inferring the clonal population structure of cancers is described, which is 10–100× times faster than existing methods, while providing results which are as accurate.
...
...

References

SHOWING 1-10 OF 51 REFERENCES
Inference of Tumor Phylogenies with Improved Somatic Mutation Discovery
TLDR
A novel algorithm is presented that uses somatic single-nucleotide variations in multiple, related tissue samples as lineage markers for phylogenetic tree reconstruction and leverages the inferred phylogeny to improve the accuracy of SNV discovery.
Clonality inference in multiple tumor samples using phylogeny
TLDR
A novel combinatorial method, named clonality inference in tumors using phylogeny (CITUP), that infers clonal populations and their frequencies while satisfying phylogenetic constraints and is able to exploit data from multiple samples.
Inferring clonal evolution of tumors from single nucleotide somatic mutations
TLDR
A new statistical model, PhyloSub, is introduced that infers the phylogeny and genotype of the major subclonal lineages represented in the population of cancer cells and can be applied to frequencies of any “binary” somatic mutation, including SNVs as well as small insertions and deletions.
Reconstructing Mutational History in Multiply Sampled Tumors Using Perfect Phylogeny Mixtures
TLDR
The Perfect Phylogeny Mixture problem of inferring a perfect phylogeny given somatic mutation data from multiple tumor samples, each of which is a superposition of cells, or “species,” is formulated and proved to be NP-hard.
PhyloWGS: Reconstructing subclonal composition and evolution from whole-genome sequencing of tumors
TLDR
A principled phylogenic correction for VAFs in loci affected by copy number alterations is introduced and it is shown that this correction greatly improves subclonal reconstruction compared to existing methods.
Subclonal phylogenetic structures in cancer revealed by ultra-deep sequencing
TLDR
The potential for ultra-deep resequencing to recapitulate the dynamics of clonal evolution in cancer cell populations is demonstrated and an algorithm to differentiate genuine haplotypes of somatic hypermutations from such artifacts is developed.
THetA: inferring intra-tumor heterogeneity from high-throughput DNA sequencing data
TLDR
THetA (Tumor Heterogeneity Analysis), an algorithm that infers the most likely collection of genomes and their proportions in a sample, for the case where copy number aberrations distinguish subpopulations, is introduced.
PyClone: statistical inference of clonal population structure in cancer
TLDR
PyClone is a Bayesian clustering method for grouping sets of deeply sequenced somatic mutations into putative clonal clusters while estimating their cellular prevalences and accounting for allelic imbalances introduced by segmental copy-number changes and normal-cell contamination.
TITAN: inference of copy number architectures in clonal cell populations from tumor whole-genome sequence data.
TLDR
A novel probabilistic model is presented, TITAN, to infer CNA and LOH events while accounting for mixtures of cell populations, thereby estimating the proportion of cells harboring each event.
SubcloneSeeker: a computational framework for reconstructing tumor clone structure for cancer variant interpretation and prioritization
TLDR
It is demonstrated how subclone reconstruction provides crucial information about tumorigenesis and relapse mechanisms; guides functional study by variant prioritization, and has the potential as a rational basis for informed therapeutic strategies for the patient.
...
...