Torbjørn Rognes

Learn More
The publication of a complete genome sequence is usually accompanied by annotations of its genes. In contrast to protein coding genes, genes for ribosomal RNA (rRNA) are often poorly or inconsistently annotated. This makes comparative studies based on rRNA genes difficult. We have therefore created computational predictors for the major rRNA species from(More)
The Smith-Waterman algorithm for local sequence alignment is more sensitive than heuristic methods for database searching, but also more time-consuming. The fastest approach to parallelisation with SIMD technology has previously been described by Farrar in 2007. The aim of this study was to explore whether further speed could be gained by other approaches(More)
MOTIVATION Sequence database searching is among the most important and challenging tasks in bioinformatics. The ultimate choice of sequence-search algorithm is that of Smith-Waterman. However, because of the computationally demanding nature of this method, heuristic programs or special-purpose hardware alternatives have been developed. Increased speed has(More)
Base excision repair (BER) is the primary DNA repair pathway that corrects base lesions that arise due to oxidative, alkylation, deamination, and depurinatiation/depyrimidination damage. BER facilitates the repair of damaged DNA via two general pathways – short-patch and long-patch. The shortpatch BER pathway leads to a repair tract of a single nucleotide.(More)
Endonuclease III from Escherichia coli is the prototype of a ubiquitous DNA repair enzyme essential for the removal of oxidized pyrimidine base damage. The yeast genome project has revealed the presence of two genes in Saccharomyces cerevisiae, NTG1 and NTG2, encoding proteins with similarity to endonuclease III. Both contain the highly conserved(More)
Repeated sequence signatures are characteristic features of all genomic DNA. We have made a rigorous search for repeat genomic sequences in the human pathogens Neisseria meningitidis, Neisseria gonorrhoeae and Haemophilus influenzae and found that by far the most frequent 9-10mers residing within coding regions are the DNA uptake sequences (DUS) required(More)
The functions of several SOS regulated genes in Escherichia coli are still unknown, including dinQ. In this work we characterize dinQ and two small RNAs, agrA and agrB, with antisense complementarity to dinQ. Northern analysis revealed five dinQ transcripts, but only one transcript (+44) is actively translated. The +44 dinQ transcript translates into a(More)
Popular de novo amplicon clustering methods suffer from two fundamental flaws: arbitrary global clustering thresholds, and input-order dependency induced by centroid selection. Swarm was developed to address these issues by first clustering nearly identical amplicons iteratively using a local threshold, and then by using clusters' internal structure and(More)
BACKGROUND VSEARCH is an open source and free of charge multithreaded 64-bit tool for processing and preparing metagenomics, genomics and population genomics nucleotide sequence data. It is designed as an alternative to the widely used USEARCH tool (Edgar, 2010) for which the source code is not publicly available, algorithm details are only rudimentarily(More)
Several methods exist for predicting non-coding RNA (ncRNA) genes in Escherichia coli (E.coli). In addition to about sixty known ncRNA genes excluding tRNAs and rRNAs, various methods have predicted more than thousand ncRNA genes, but only 95 of these candidates were confirmed by more than one study. Here, we introduce a new method that uses automatic(More)