Common integration sites of published datasets identified using a graph-based framework

Abstract

With next-generation sequencing, the genomic data available for the characterization of integration sites (IS) has dramatically increased. At present, in a single experiment, several thousand viral integration genome targets can be investigated to define genomic hot spots. In a previous article, we renovated a formal CIS analysis based on a rigid fixed window demarcation into a more stretchy definition grounded on graphs. Here, we present a selection of supporting data related to the graph-based framework (GBF) from our previous article, in which a collection of common integration sites (CIS) was identified on six published datasets. In this work, we will focus on two datasets, ISRTCGD and ISHIV, which have been previously discussed. Moreover, we show in more detail the workflow design that originates the datasets.

DOI: 10.1016/j.csbj.2015.11.004

Extracted Key Phrases

6 Figures and Tables

Cite this paper

@inproceedings{Vasciaveo2016CommonIS, title={Common integration sites of published datasets identified using a graph-based framework}, author={Alessandro Vasciaveo and Ivana Velevska and Gianfranco Politano and Alessandro Savino and Manfred Schmidt and Raffaele Fronza}, booktitle={Computational and structural biotechnology journal}, year={2016} }