ERNE-BS5: aligning BS-treated sequences by multiple hits on a 5-letters alphabet

  title={ERNE-BS5: aligning BS-treated sequences by multiple hits on a 5-letters alphabet},
  author={Nicola Prezza and Cristian Del Fabbro and Francesco Vezzi and Emanuale De Paoli and Alberto Policriti},
  journal={Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine},
  • N. Prezza, C. D. Fabbro, A. Policriti
  • Published 7 October 2012
  • Computer Science, Biology
  • Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine
Cytosine methylation is a DNA modification that has great impact on the regulation of gene expression and important implications for the biology and health of several living beings, including humans. Bisulfite conversion followed by next generation sequencing (BS-seq) of DNA is the gold standard technique used to detect DNA methylation at single-base resolution on a genome scale through the identification of 5-methylcytosine (5-mC). However, by converting unmethylated cytosines into thymines… 

Figures and Tables from this paper

Fast, accurate, and lightweight analysis of BS-treated reads with ERNE 2

The new functionalities of the bisulfite aligner and caller make it a fast and memory efficient tool, useful to analyze big datasets with little computational resources, to easily process target enrichment data, and produce statistics such as protocol efficiency and coverage as a function of the distance from target regions.

Comprehensive benchmarking of software for mapping whole genome bisulfite data: from read alignment to DNA methylation analysis

Precision-recall of simulated alignments reveals on-balance that BWA-meth and BSMAP are able to make the best use of the data during mapping, and the influence of difficult-to-map regions, characterised by deviations in sequencing depth over repeat annotations, is evaluated.

Evaluating and Improving Performance of Bisulfite Short Reads Alignment and the Identification of Differentially Methylated Sites

A strategy to improve the mapping efficiencies of the existing bisulfite short reads software by finding unique locations for multireads and modifying the default settings in both WFMM and methylKit to be more tailored to a given methylation profile, thus improving the accuracy of detecting DMCs.

Objective and Comprehensive Evaluation of Bisulfite Short Read Mapping Tools

This work compared five bisulfite short read mapping tools, BSMAP, Bismark, BS-Seeker, BiSS, and BRAT-BW, representing two classes of mapping algorithms (hash table and suffix/prefix tries), and examined their mapping efficiency, usability, running time, and effects of changing default parameter settings.

Evaluation of preprocessing, mapping and postprocessing algorithms for analyzing whole genome bisulfite sequencing data

It is confirmed that paired-end sequencing reduced error rate and enhanced sensitivity for both read mapping and methylation level estimation, especially for short reads and in repetitive regions of the human genome.

Alignment of Short Reads: A Crucial Step for Application of Next-Generation Sequencing Data in Precision Medicine

Current available alignment algorithms and their major strategies such as seed-and-extend and q-gram filter are reviewed and the challenges in current alignment algorithms are discussed, including alignment in multiple repeated regions, long reads alignment and alignment facilitated with known genetic variants.

A Rank-Based Sequence Aligner with Applications in Phylogenetic Analysis

The empirical results show that the LRD aligner proposed in this work is highly accurate from a biological point of view, and can be considered as a good alternative to standard alignment tools, especially when the accuracy of the aligner is of high importance.

Novel computational techniques for mapping and classifying Next-Generation Sequencing data. (Nouvelles techniques informatiques pour la localisation et la classification de données de séquençage haut débit)

This thesis presents novel computational techniques for read mapping and taxonomic classification of NGS reads and provides the first comprehensive overview of this method and demonstrates its qualities using Dynamic Mapping Simulator, a pipeline that compares various dynamic mapping scenarios to static mapping and iterative referencing.

An improved filtering algorithm for big read datasets and its application to single-cell assembly

This paper presents Bignorm, a faster and quality-conscious read filtering algorithm, which allows assemblies of competitive quality in comparison to Diginorm, while being much faster.



Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications

Bismark is a flexible tool for the time-efficient analysis of BS-Seq data which performs both read mapping and methylation calling in a single convenient step and enables bench scientists to visualize and interpret their methylation data soon after the sequencing run is completed.

BS Seeker: precise mapping for bisulfite sequencing

BS Seeker provides fast and accurate mapping of bisulfite-converted reads, it can work with BS reads generated from the two different experimental protocols, and is able to efficiently map reads to large mammalian genomes.

An alignment algorithm for bisulfite sequencing using the Applied Biosystems SOLiD System

SOCS-B is presented, a reference-based, un-gapped alignment algorithm for the SOLiD System that is tolerant of both bisulfite-induced nucleotide substitutions and a parametric number of sequencing errors, facilitating bisulfITE sequencing on this platform.

Shotgun bisulphite sequencing of the Arabidopsis genome reveals DNA methylation patterning

A map at single-base-pair resolution of methylated cytosines for Arabidopsis is generated by combining bisulphite treatment of genomic DNA with ultra-high-throughput sequencing using the Illumina 1G Genome Analyser and Solexa sequencing technology.

BSMAP: whole genome bisulfite sequence MAPping program

This work has developed an efficient bisulfite reads mapping algorithm BSMAP, which is able to map high-throughput bisulfITE reads at whole genome level with feasible memory and CPU usage.

DNA methylome analysis using short bisulfite sequencing data

The challenges of BS-seq mapping as they apply to both base and color-space data are summarized and the effect of sequencing errors and contaminants on inferred methylation levels are explored.

B-SOLANA: an approach for the analysis of two-base encoding bisulfite sequencing data

B-SOLANA, an approach for the analysis of two-base encoding (colorspace) bisulfite sequencing data on the SOLiD platform of Life Technologies, enables a fast and accurate analysis of large raw sequence datasets.

BRAT-BW: efficient and accurate mapping of bisulfite-treated reads

BRAT-BW is a fast, accurate and memory-efficient tool that maps bisulfite-treated short reads to a reference genome using the FM-index (Burrows-Wheeler transform) and is significantly more memory efficient and faster on longer reads than current state-of-the-art tools for BS-seq data, without compromising on accuracy.

Spontaneous epigenetic variation in the Arabidopsis thaliana methylome

Compared genome-wide DNA methylation among 10 A. thaliana lines, differentially methylated sites were farther from transposable elements and showed less association with short interfering RNA expression than invariant positions, which has important implications for the potential contribution of sequence-independent epialleles to plant evolution.