Learn More
The human genome is pervasively transcribed, producing thousands of non-coding RNA transcripts. The majority of these transcripts are long non-coding RNAs (lncRNAs) and novel lncRNA genes are being identified at rapid pace. To streamline these efforts, we created LNCipedia, an online repository of lncRNA transcripts and annotation. Here, we present(More)
An increasing number of studies involve integrative analysis of gene and protein expression data, taking advantage of new technologies such as next-generation transcriptome sequencing and highly sensitive mass spectrometry (MS) instrumentation. Recently, a strategy, termed ribosome profiling (or RIBO-seq), based on deep sequencing of ribosome-protected mRNA(More)
In the search for new methods of pest control, the potential of RNA interference (RNAi) is being explored. Because the gut is the first barrier for the uptake of double-stranded (ds)RNA, pyrosequencing of the gut transcriptome is a powerful tool for obtaining the necessary sequences for specific dsRNA-mediated pest control. In the present study, a dataset(More)
Epigenetics, and more specifically DNA methylation is a fast evolving research area. In almost every cancer type, each month new publications confirm the differentiated regulation of specific genes due to methylation and mention the discovery of novel methylation markers. Therefore, it would be extremely useful to have an annotated, reviewed, sorted and(More)
An increasing amount of studies integrate mRNA sequencing data into MS-based proteomics to complement the translation product search space. However, several factors, including extensive regulation of mRNA translation and the need for three- or six-frame-translation, impede the use of mRNA-seq data for the construction of a protein sequence search database.(More)
Usage of presumed 5'UTR or downstream in-frame AUG codons, next to non-AUG codons as translation start codons contributes to the diversity of a proteome as protein isoforms harboring different N-terminal extensions or truncations can serve different functions. Recent ribosome profiling data revealed a highly underestimated occurrence of database(More)
The term peptidomics for a new promising "omics" field was not introduced until the beginning of 2000. The approach has been proven successful in several domains such as neuroendocrine research and biomarker or drug discovery. This review reports on bioinformatics tools and methodologies within the peptidomics field and the application thereof. Obviously, a(More)
With the advent of ribosome profiling, a next generation sequencing technique providing a "snap-shot'' of translated mRNA in a cell, many short open reading frames (sORFs) with ribosomal activity were identified. Follow-up studies revealed the existence of functional peptides, so-called micropeptides, translated from these 'sORFs', indicating a new class of(More)
Peptidomics is the identification and study of the in vivo biologically active peptide profile. A combination of high performance liquid chromatography, mass spectrometry, and bioinformatics tools such as database search engines are commonly used to perform the analysis. We report a methodology based on a database system holding the completed translated(More)
It was long assumed that proteins are at least 100 amino acids (AAs) long. Moreover, the detection of short translation products (e.g. coded from small Open Reading Frames, sORFs) is very difficult as the short length makes it hard to distinguish true coding ORFs from ORFs occurring by chance. Nevertheless, over the past few years many such non-canonical(More)