Genome-wide mapping and assembly of structural variant breakpoints in the mouse genome.

@article{Quinlan2010GenomewideMA,
  title={Genome-wide mapping and assembly of structural variant breakpoints in the mouse genome.},
  author={Aaron R. Quinlan and Royden A. Clark and S S Sokolova and Mitchell L. Leibowitz and Yujun Zhang and Matthew E. Hurles and Joshua Chang Mell and Ira M. Hall},
  journal={Genome research},
  year={2010},
  volume={20 5},
  pages={
          623-35
        }
}
Structural variation (SV) is a rich source of genetic diversity in mammals, but due to the challenges associated with mapping SV in complex genomes, basic questions regarding their genomic distribution and mechanistic origins remain unanswered. We have developed an algorithm (HYDRA) to localize SV breakpoints by paired-end mapping, and a general approach for the genome-wide assembly and interpretation of breakpoint sequences. We applied these methods to two inbred mouse strains: C57BL/6J and… 

Figures and Tables from this paper

The fine-scale architecture of structural variants in 17 mouse genomes

TLDR
An integrative approach using experimental analyses to train computational SV calling is essential for the accurate resolution of the architecture of SVs, finding considerable complexity in SV formation.

Sequence-based characterization of structural variation in the mouse genome

TLDR
It is inferred that insertion of retrotransposons causes more than half of SVs, and 24 SVs are identified that disrupt coding exons, acting as rare variants of large effect on gene function.

Local sequence assembly reveals a high-resolution profile of somatic structural variations in 97 cancer genomes

TLDR
It is discovered that non-allelic homologous recombination is the primary mechanism for generating somatic SVs in acute myeloid leukemia, which contrasts with results for the other five types of solid tumors, in which non-homologous end joining and microhomology end joining are the predominant mechanisms.

Detection and interpretation of genomic structural variation in mammals.

TLDR
The current state of knowledge of SV in mammals is summarized, with a focus on the similarities and differences between domesticated species and human, and methods to identify SV breakpoints from next-generation sequence (NGS) data by paired-end mapping, split-read mapping, and local assembly are presented.

Mapping and characterization of structural variation in 17,795 deeply sequenced human genomes

TLDR
A cloud-based pipeline is used to map and characterize SV in 17,795 deeply sequenced human genomes from common disease trait mapping studies and exploit this resource to infer the dosage sensitivity of genes and non-coding elements, revealing strong trends related to regulatory element class, conservation and cell-type specificity.

Detection of Structural Variants Involving Repetitive Regions in the Reference Genome

TLDR
This work has developed an algorithmic approach for read mapping and SV detection based on the framework of A-Bruijn graphs that is able to identify transposition events of insertion sequence (IS) elements, a class of simple mobile genetic elements (MGEs), in E. coli by using simulated and real paired-end sequence data acquired from E. bacteria mutation accumulation lines.

The genomic landscape shaped by selection on transposable elements across 18 mouse strains

TLDR
It is shown that past TE insertions have often been highly deleterious, and help to prioritize TE variants according to their likely contribution to gene expression or phenotype variation, and most TE variants that cause gene expression changes appear to be purged rapidly by purifying selection.

Whole-genome resequencing allows detection of many rare LINE-1 insertion alleles in humans.

TLDR
This article presents evidence for 1016 L1 insertions across all studies to date that are not represented in the reference human genome assembly, many of which appear to be specific to populations or groups of populations, particularly Africans.

Mapping and Characterization of Structural Variation in 17,795 Human Genomes

TLDR
A scalable pipeline is used to map and characterize structural variants in 17,795 deeply sequenced human genomes to create the largest, to the authors' knowledge, whole-genome-sequencing-based structural variant resource so far and infer the dosage sensitivity of genes and noncoding elements.
...

References

SHOWING 1-10 OF 66 REFERENCES

Paired-End Mapping Reveals Extensive Structural Variation in the Human Genome

TLDR
High-throughput and massive paired-end mapping (PEM) was used to map SVs in an African and in a putatively European individual and identified shared and divergent SVs relative to the reference genome, documenting that the number of SVs among humans is much larger than initially hypothesized; many of the SVs potentially affect gene function.

Mouse segmental duplication and copy number variation

TLDR
The authors' data suggest that different mouse strains show comparable, if not greater, copy number polymorphism when compared to human; however, such variation is more locally restricted.

Mobile elements create structural variation: analysis of a complete human genome.

TLDR
This study presents the first comprehensive analysis of mobile element-related structural variants in the complete DNA sequence of an individual and demonstrates that mobile elements play an important role in generating inter-individual structural variation.

Hotspots of mammalian chromosomal evolution

TLDR
This analysis supports a nonrandom model of chromosomal evolution that implicates specific regions within the mammalian genome as having been predisposed to both recurrent small-scale duplication and large-scale evolutionary rearrangements.

Extensive variation between inbred mouse strains due to endogenous L1 retrotransposition.

TLDR
Recent endogenous L1 retrotransposition has diversified genomic structures and transcripts extensively, distinguishing mouse lineages and driving a major portion of natural genetic variation.

Sequencing human-gibbon breakpoints of synteny reveals mosaic new insertions at rearrangement sites.

TLDR
Analysis of 24 synteny breakpoints in the white-cheeked gibbon provides a model for a replication-dependent repair mechanism for double-strand breaks (DSBs) at rearrangement sites and insights into the structure and formation of primate segmental duplications at sites of genomic rearrangements during evolution.

An initial map of insertion and deletion (INDEL) variation in the human genome.

TLDR
An initial map of human INDEL variation that contains 415,436 unique INDEL polymorphisms, which range from 1 bp to 9989 bp in length and are split almost equally between insertions and deletions, relative to the chimpanzee genome sequence.

Mapping and sequencing of structural variation from eight human genomes

TLDR
This work employs a clone-based method to interrogate intermediate structural variation in eight individuals of diverse geographic ancestry and provides the first high-resolution sequence map of human structural variation—a standard for genotyping platforms and a prelude to future individual genome sequencing projects.

BreakDancer: An algorithm for high resolution mapping of genomic structural variation

TLDR
The algorithm BreakDancer predicts a wide variety of structural variants including insertion-deletions (indels), inversions and translocations and sensitively and accurately detected indels ranging from 10 base pairs to 1 megabase pair that are difficult to detect via a single conventional approach.

A Comparison of Whole-Genome Shotgun-Derived Mouse Chromosome 16 and the Human Genome

TLDR
Comparison of the structure and protein-coding potential of Mmu 16 with that of the homologous segments of the human genome identifies regions of conserved synteny with human chromosomes (Hsa) 3, 8, 12, 16, 21, and 22.
...