Learn More
The sequence of the mouse genome is a key informational tool for understanding the contents of the human genome and a key experimental tool for biomedical research. Here, we report the results of an international collaboration to produce a high-quality draft sequence of the mouse genome. We also present an initial comparative analysis of the mouse and human(More)
SUMMARY INFERNAL builds consensus RNA secondary structure profiles called covariance models (CMs), and uses them to search nucleic acid sequence databases for homologous RNAs, or to create new sequence- and structure-based multiple sequence alignments. AVAILABILITY Source code, documentation and benchmark downloadable from http://infernal.janelia.org.(More)
Rfam is a collection of RNA sequence families, represented by multiple sequence alignments and covariance models (CMs). The primary aim of Rfam is to annotate new members of known RNA families on nucleotide sequences, particularly complete genomes, using sensitive BLAST filters in combination with CMs. A minority of families with a very broad taxonomic(More)
The Rfam database aims to catalogue non-coding RNAs through the use of sequence alignments and statistical profile models known as covariance models. In this contribution, we discuss the pros and cons of using the online encyclopedia, Wikipedia, as a source of community-derived annotation. We discuss the addition of groupings of related RNA families into(More)
Hearing loss is the most common sensory deficit in humans, affecting 1 in 500 newborns. Due to its genetic heterogeneity, comprehensive diagnostic testing has not previously been completed in a large multiethnic cohort. To determine the aggregate contribution inheritance makes to non-syndromic hearing loss, we performed comprehensive clinical genetic(More)
Whole exome sequencing (WES) has revolutionized the way we think about and diagnose epileptic encephalopathies. Multiple recent review articles discuss the benefits of WES and suggest various algorithms to follow for determining the etiology of epileptic encephalopathies. Incorporation of WES in these algorithms is leading to the discovery of new genetic(More)
There is tremendous potential for genome sequencing to improve clinical diagnosis and care once it becomes routinely accessible, but this will require formalizing research methods into clinical best practices in the areas of sequence data generation, analysis, interpretation and reporting. The CLARITY Challenge was designed to spur convergence in methods(More)
Copy number variants (CNVs) are a well-recognized cause of genetic disease; however, methods for their identification are often gene-specific, excluded as ‘routine’ in screens of genetically heterogeneous disorders, and not implemented in most next-generation sequencing pipelines. For this reason, the contribution of CNVs to non-syndromic hearing loss(More)
OBJECTIVES In this report, we present a male patient with no family history of hearing loss, in whom we identified a novel de novo mutation in the POU3F4 gene. METHODS One hundred ninety-four (194) Japanese subjects from unrelated and nonconsanguineous families were enrolled in this study. We used targeted genomic enrichment and massively parallel(More)
MOTIVATION Homology search for RNAs can use secondary structure information to increase power by modeling base pairs, as in covariance models, but the resulting computational costs are high. Typical acceleration strategies rely on at least one filtering stage using sequence-only search. RESULTS Here we present the multi-segment CYK (MSCYK) filter, which(More)