Paulo P. Amaral

Learn More
Studies of the transcriptional output of the human and mouse genomes have revealed that there are many more transcripts produced than can be accounted for by predicted protein-coding genes. Using a custom microarray, we have identified 184 non-coding RNAs that exhibit more than twofold up- or down-regulation upon differentiation of C2C12 myoblasts into(More)
The transcriptional networks that regulate embryonic stem (ES) cell pluripotency and lineage specification are the subject of considerable attention. To date such studies have focused almost exclusively on protein-coding transcripts. However, recent transcriptome analyses show that the mammalian genome contains thousands of long noncoding RNAs (ncRNAs),(More)
The past few years have revealed that the genomes of all studied eukaryotes are almost entirely transcribed, generating an enormous number of non-protein-coding RNAs (ncRNAs). In parallel, it is increasingly evident that many of these RNAs have regulatory functions. Here, we highlight recent advances that illustrate the diversity of ncRNA control of genome(More)
Large numbers of long RNAs with little or no protein-coding potential [long noncoding RNAs (lncRNAs)] are being identified in eukaryotes. In parallel, increasing data describing the expression profiles, molecular features and functions of individual lncRNAs in a variety of systems are accumulating. To enable the systematic compilation and updating of this(More)
The Sox2 gene is a key regulator of pluripotency embedded within an intron of a long noncoding RNA (ncRNA), termed Sox2 overlapping transcript (Sox2ot), which is transcribed in the same orientation. However, this ncRNA remains uncharacterized. Here we show that Sox2ot has multiple transcription start sites associated with genomic features that indicate(More)
RNAs transcribed from intronic regions of genes are involved in a number of processes related to post-transcriptional control of gene expression. However, the complement of human genes in which introns are transcribed, and the number of intronic transcriptional units and their tissue expression patterns are not known. A survey of mRNA and EST public(More)
Non-protein-coding sequences increasingly dominate the genomes of multicellular organisms as their complexity increases, in contrast to protein-coding genes, which remain relatively static. Most of the mammalian genome and indeed that of all eukaryotes is expressed in a cell- and tissue-specific manner, and there is mounting evidence that much of this(More)
1 Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, Australia, 2 Watson School of Biological Sciences, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America, 3 Broad Institute, Cambridge, Massachusetts, United States of America, 4 MRC Functional Genomics Unit, Department of Physiology,(More)
Genome-wide analyses of the eukaryotic transcriptome have revealed that the majority of the genome is transcribed, producing large numbers of non-protein-coding RNAs (ncRNAs). This surprising observation challenges many assumptions about the genetic programming of higher organisms and how information is stored and organized within the genome. Moreover, the(More)