Learn More
RNA-binding proteins are key regulators of gene expression, yet only a small fraction have been functionally characterized. Here we report a systematic analysis of the RNA motifs recognized by RNA-binding proteins, encompassing 205 distinct genes from 24 diverse eukaryotes. The sequence specificities of RNA-binding proteins display deep evolutionary(More)
SUMMARY pybedtools is a flexible Python software library for manipulating and exploring genomic datasets in many common formats. It provides an intuitive Python interface that extends upon the popular BEDTools genome arithmetic tools. The library is well documented and efficient, and allows researchers to quickly develop simple, yet powerful scripts that(More)
Gene dosage change is a mild perturbation that is a valuable tool for pathway reconstruction in Drosophila. While it is often assumed that reducing gene dose by half leads to two-fold less expression, there is partial autosomal dosage compensation in Drosophila, which may be mediated by feedback or buffering in expression networks. We profiled expression in(More)
The principles underlying the architectural landscape of chromatin beyond the nucleosome level in living cells remains largely unknown despite its potential to play a role in mammalian gene regulation. We investigated the three-dimensional folding of a 1 Mbp region of human chromosome 11 containing the β-globin genes by integrating looping interactions of(More)
Chromatin insulators organize the genome into distinct transcriptional domains and contribute to cell type-specific chromatin organization. However, factors regulating tissue-specific insulator function have not yet been discovered. Here we identify the RNA recognition motif-containing protein Shep as a direct interactor of two individual components of the(More)
Chromatin insulators are functionally conserved DNA-protein complexes situated throughout the genome that organize independent transcriptional domains. Previous work implicated RNA as an important cofactor in chromatin insulator activity, although the precise mechanisms are not yet understood. Here we identify the exosome, the highly conserved major(More)
Here we introduce metaseq, a software library written in Python, which enables loading multiple genomic data formats into standard Python data structures and allows flexible, customized manipulation and visualization of data from high-throughput sequencing studies. We demonstrate its practical use by analyzing multiple datasets related to chromatin(More)
This peer-reviewed article was published immediately upon acceptance. It can be downloaded, printed and distributed freely for any purposes (see copyright notice below). which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Abstract Background
  • 1