HGT-ID: an efficient and sensitive workflow to detect human-viral insertion sites using next-generation sequencing data

@article{Baheti2018HGTIDAE,
  title={HGT-ID: an efficient and sensitive workflow to detect human-viral insertion sites using next-generation sequencing data},
  author={Saurabh Baheti and Xiaojia Tang and Daniel R. O'Brien and Nicholas Chia and Lewis Rowland Roberts and Heidi Nelson and Judy C. Boughey and Liewei Wang and Matthew P. Goetz and Jean-Pierre A. Kocher and Krishna R. Kalari},
  journal={BMC Bioinformatics},
  year={2018},
  volume={19}
}
BackgroundTransfer of genetic material from microbes or viruses into the host genome is known as horizontal gene transfer (HGT). The integration of viruses into the human genome is associated with multiple cancers, and these can now be detected using next-generation sequencing methods such as whole genome sequencing and RNA-sequencing.ResultsWe designed a novel computational workflow, HGT-ID, to identify the integration of viruses into the human genome using the sequencing data. The HGT-ID… 
Exogene: A performant workflow for detecting viral integrations from paired-end next-generation sequencing data
TLDR
Exogene’s read filtering and breakpoint detection strategies yield integration coordinates that are highly concordant with long read validation, and is applicable to multiple types of paired-end sequence data, including genome, exome, RNA-Seq and targeted capture.
Exogene: A performant workflow for detecting viral integrations from paired-end next-generation sequencing data
TLDR
Exogene’s read filtering and breakpoint detection strategies yield integration coordinates that are highly concordant with those found in long read validation sets, and is applicable to multiple types of paired-end sequence data, including genome, exome, RNA-Seq or targeted capture.
ViR: a tool to solve intrasample variability in the prediction of viral integration sites using whole genome sequencing data
TLDR
ViR is a pipeline which solves the dispersion of reads due to intrasample variability in sequencing data from both single and pooled DNA samples thus ameliorating the detection of integration sites and will open new venues to explore the biology of EVEs, especially in non-model organisms.
ViR: a tool to account for intrasample variability in the detection of viral integrations
TLDR
ViR is a pipeline which ameliorates the detection of integration sites by solving the dispersion of reads in genome assemblies that are rich of repetitive DNA, and can be readily adopted to detect any LT event providing ad hoc non-host sequences to interrogate.
A comprehensive analysis of RHOA mutation positive
TLDR
The results demonstrate that there are specific cell biological and morphological manifestations of RHOA mutations in cases of AITL, and computational analysis of single cell morphological parameters revealed that RHOA mutated cells have decreased eccentricity compared with RHOA non-mutated AITl cells.
Causes and Consequences of HPV Integration in Head and Neck Squamous Cell Carcinomas: State of the Art
TLDR
The literature of the most recent five years focusing on the impact of HPV integration in HNSCCs is reviewed, covering aspects of detection techniques used, integration loci identified, and associations with genomic and clinical data.
Mutations in JAK/STAT and NOTCH1 Genes Are Enriched in Post-Transplant Lymphoproliferative Disorders
TLDR
It is suggested that immune regulatory pathways play an essential role in PTLD, with the JAK/STAT pathway affected in many PTLDs.
Indolent In Situ B-Cell Neoplasms With MYC Rearrangements Show Somatic Mutations in MYC and TNFRSF14 by Next-generation Sequencing
TLDR
Two cases of in situ B-cell neoplasms with MYC rearrangements (IS-BCN, MYC+) occurring, and focally confined to ≤4 lymphoid follicles in otherwise healthy individuals and without clinical progression despite minimal intervention are identified.
Expansion and persistence of antibiotic-specific resistance genes following antibiotic treatment
TLDR
It is suggested that antibiotic treatment can lead to a persistent expansion of antibiotic resistance genes in the human gut microbiota and provide further data in support of good antibiotic stewardship.
...
...

References

SHOWING 1-10 OF 59 REFERENCES
VirusFinder: Software for Efficient and Accurate Detection of Viruses and Their Integration Sites in Host Genomes through Next Generation Sequencing Data
TLDR
VirusFinder’s unique features include the characterization of insertion loci of virus of arbitrary type in the host genome and high accuracy and computational efficiency as a result of its well-designed pipeline.
VirusSeq: software to identify viruses and their integration sites using next-generation sequencing of human cancer tissue
TLDR
A new algorithmic method, VirusSeq, that accurately detects the known viruses and their integration sites in the human genome using next-generation sequencing data and can also perform this function using whole-genome sequencing data of human tissue.
VERSE: a novel approach to detect virus integration in host genomes through reference genome customization
TLDR
This work presents a new approach that detects Virus intEgration sites through iterative Reference SEquence customization (VERSE), which is the first approach to improve detection through customizing reference genomes.
BATVI: Fast, sensitive and accurate detection of virus integrations
TLDR
The performance of BatVI was compared with existing methods VirusFinder and VirusSeq using both simulated and real-life datasets of liver cancer patients and it was able to predict almost twice the number of true positives compared to other methods while maintaining a false positive rate less than 1%.
ViralFusionSeq: accurately discover viral integration events and reconstruct fusion transcripts at single-base resolution
TLDR
VFS is presented, which combines soft-clipping information, read-pair analysis and targeted de novo assembly to discover and annotate viral–human fusions and demonstrate that VFS is both sensitive and highly accurate.
Bacteria-Human Somatic Cell Lateral Gene Transfer Is Enriched in Cancer Samples
TLDR
Evidence is presented that bacterial DNA integrates into the human somatic genome through an RNA intermediate, and that such integrations are detected more frequently in tumors than normal samples, RNA than DNA samples, and the mitochondrial genome than the nuclear genome.
No significant viral transcription detected in whole breast cancer transcriptomes
TLDR
The results show that no viral sequences are expressed in significant amounts in the BC investigated, and the presence of non-transcribed viral DNA cannot be excluded.
Landscape of DNA Virus Associations across Human Malignant Cancers: Analysis of 3,775 Cases Using RNA-Seq
TLDR
While further validation is necessary for specific cancer types, the findings highlight the utility of RNA-Seq in detecting tumor-associated DNA viruses and identifying viral integration sites that may unravel novel mechanisms of cancer pathogenesis.
The landscape of viral expression and host gene fusion and adaptation in human cancer
TLDR
A virus‐tumour map of unprecedented scale is provided that constitutes a reference for future studies of tumour‐associated viruses using transcriptome sequencing data and argues strongly against viral aetiology in several cancers where this has frequently been proposed.
BEDTools: a flexible suite of utilities for comparing genomic features
TLDR
A new software suite for the comparison, manipulation and annotation of genomic features in Browser Extensible Data (BED) and General Feature Format (GFF) format, which allows the user to compare large datasets (e.g. next-generation sequencing data) with both public and custom genome annotation tracks.
...
...