Learn More
With the maturation of next-generation DNA sequencing (NGS) technologies, the throughput of DNA sequencing reads has soared to over 600 gigabases from a single instrument run. General purpose computing on graphics processing units (GPGPU), extracts the computing power from hundreds of parallel stream processors within graphics processing cores and provides(More)
SUMMARY Here we introduce catRAPID omics, a server for large-scale calculations of protein-RNA interactions. Our web server allows (i) predictions at proteomic and transcriptomic level; (ii) use of protein and RNA sequences without size restriction; (iii) analysis of nucleic acid binding regions in proteins; and (iv) detection of RNA motifs involved in(More)
Motivation: Recent technological advances revealed that an unexpected large number of proteins interact with transcripts even if the RNA-binding domains are not annotated. We introduce catRAPID signature to identify ribonucleoproteins based on physico-chemical features instead of sequence similarity searches. The algorithm, trained on human proteins and(More)
Increasing evidence indicates that RNA plays an active role in a number of neurodegenerative diseases. We recently introduced a theoretical framework, catRAPID, to predict the binding ability of protein and RNA molecules. Here, we use catRAPID to investigate ribonucleoprotein interactions linked to inherited intellectual disability, amyotrophic lateral(More)
MOTIVATION The recent shift towards high-throughput screening is posing new challenges for the interpretation of experimental results. Here we propose the cleverSuite approach for large-scale characterization of protein groups. DESCRIPTION The central part of the cleverSuite is the cleverMachine (CM), an algorithm that performs statistics on protein(More)
It has been reported that genes up-regulated in cancer are often down-regulated in neurodegenerative disorders and vice versa. The fact that apparently unrelated diseases share functional pathways suggests a link between their etiopathogenesis and the properties of molecules involved. Are there specific features that explain the exclusive association of(More)
MOTIVATION Recent technological advances revealed that an unexpected large number of proteins interact with transcripts even if they lack annotated RNA-binding domains. We introduce catRAPID signature to identify ribonucleo-proteins based on physico-chemical features instead of sequence similarity searchers. The algorithm, trained on human proteins and(More)
Comparison between multiple protein datasets requires the choice of an appropriate reference system and a number of variables to describe their differences. Here we introduce an innovative approach to discriminate multiple protein datasets (multiCM) and to measure enrichments in gene ontology terms (cleverGO) using semantic similarities. We illustrate the(More)
The biological function of multiple repetitions of single amino acids, or homo-repeats, is largely unknown, but their occurrence in proteins has been associated with more than 20 hereditary diseases. Analysing 122 bacterial and eukaryotic genomes, we observed that the number of proteins containing homo-repeats is significantly larger than expected from(More)
  • 1