Torrents of sequence

  title={Torrents of sequence},
  author={Nicole Rusk},
  journal={Nature Methods},
It is hard to overstate the positive changes that high-throughput sequencing has brought to biological research in the last few years. Each year developments with the potential to change the sequencing landscape emerge, and 2010 was no different. Ion Torrent launched in February 2010 with a sequencer based not on dye-labeled oligonucleotides and expensive optics but on ion detection (the machine has been casually referred to as ‘a pH meter that sequences’). The principle is straightforward: DNA… 

Efficient approximate string matching techniques for sequence alignment

This thesis tackles the problem of sequence alignment as a step within the analysis of HTS data, and introduces state-of-the-art index structures and approximate string matching algorithms.

Improving sequencing by tunneling with multiplexing and cross-correlations

Sequencing by tunneling is a next-generation approach to read single-base information using electronic tunneling transverse to the single-stranded DNA (ssDNA) backbone while the latter is

Discovery of posttranscriptional regulatory RNAs using next generation sequencing technologies.

This chapter describes the use of these technologies as a sequencing technique for transcriptome analysis and discovery of regulatory RNA elements in the context of three main platforms: Illumina HiSeq, 454 pyrosequencing, and SOLiD sequencing.

MinION TM : New, Long Read, Portable Nucleic Acid Sequencing Device

The background to the technology is reviewed, the current device is focussed on DNA sequencing, directly sequencing RNA is a likely development and it is compared to current next generation sequencing.

DNA Sequencing Technologies

In this review, a historical snapshot of DNA sequencing is taken with an emphasis on data manipulation and tools, and the existing DNA sequencing technologies are critically discussed for their pitfalls and potential solutions.

Approximate string matching for high-throughput sequencing

This thesis presents novel methods for the mapping of high-throughput sequencing DNA reads, based on state of the art approximate string matching algorithms and data structures, and provides all implementations within SeqAn, the generic C++ template library for sequence analysis, which is freely available under

Compression of Short Read data sets

This thesis looks at using differential compression both for raw unmapped reads and reads that been mapped to a reference genome using a reference-based solution and an intra-frame differencing similar to frame-to-frame approach in video compression.

Next-generation sequencing data analysis

The aim of this PhD project was to investigate and develop methods and parameters to identify pathogenic variants and genes from large amounts of exome sequencing data to reduce runtime, memory usage, required disk space and hands-on time, as well as to increase flexibility and allow easier adaptation and extension.

A survey of genome sequence assembly techniques and algorithms using high-performance computing

This paper walks through the entire streamlined process while describing, analyzing, and commenting on the algorithms and techniques that have been designed and implemented for each of the stages, streamlining all the individual tasks related to the whole genome shotgun sequencing into a sequence of loosely coupled stages.

Pathoscope: species identification and strain attribution with unassembled sequencing data.

The method, Pathoscope, capitalizes on a Bayesian statistical framework that accommodates information on sequence quality, mapping quality, and provides posterior probabilities of matches to a known database of target genomes to accurately discriminate between very closely related strains of the same species with very little coverage of the genome.