Learn More
High-throughput mRNA sequencing (RNA-Seq) promises simultaneous transcript discovery and abundance estimation. However, this would require algorithms that are not restricted by prior gene annotations and that account for alternative transcription and splicing. Here we introduce such algorithms in an open-source software program called Cufflinks. To test(More)
The DJ-1 gene encodes a ubiquitous, highly conserved protein. Here, we show that DJ-1 mutations are associated with PARK7, a monogenic form of human parkinsonism. The function of the DJ-1 protein remains unknown, but evidence suggests its involvement in the oxidative stress response. Our findings indicate that loss of DJ-1 function leads to(More)
High-throughput mRNA sequencing (RNA-Seq) holds the promise of simultaneous transcript discovery and abundance estimation 1-3. We introduce an algorithm for transcript assembly coupled with a statistical model for RNA-Seq experiments that produces estimates of abundances. Our algorithms are implemented in an open source software program called Cufflinks. To(More)
To gain insight into how genomic information is translated into cellular and developmental programs, the Drosophila model organism Encyclopedia of DNA Elements (modENCODE) project is comprehensively mapping transcripts, histone modifications, chromosomal proteins, transcription factors, replication proteins and intermediates, and nucleosome properties(More)
Drosophila melanogaster cell lines are important resources for cell biologists. Here, we catalog the expression of exons, genes, and unannotated transcriptional signals for 25 lines. Unannotated transcription is substantial (typically 19% of euchromatic signal). Conservatively, we identify 1405 novel transcribed regions; 684 of these appear to be new exons(More)
Preaxial polydactyly (PPD) is a common limb malformation in human. A number of polydactylous mouse mutants indicate that misexpression of Shh is a common requirement for generating extra digits. Here we identify a translocation breakpoint in a PPD patient and a transgenic insertion site in the polydactylous mouse mutant sasquatch (Ssq). The genetic lesions(More)
Phytochrome photosensors control a vast gene network in streptophyte plants, acting as master regulators of diverse growth and developmental processes throughout the life cycle. In contrast with their absence in known chlorophyte algal genomes and most sequenced prasinophyte algal genomes, a phytochrome is found in Micromonas pusilla, a widely distributed(More)
Correct gene prediction is impaired by the presence of processed pseudogenes: nonfunctional, intronless copies of real genes found elsewhere in the genome. Gene prediction programs frequently mistake processed pseudogenes for real genes or exons, leading to biologically irrelevant gene predictions. While methods exist to identify processed pseudogenes in(More)
Since its start, the Mammalian Gene Collection (MGC) has sought to provide at least one full-protein-coding sequence cDNA clone for every human and mouse gene with a RefSeq transcript, and at least 6200 rat genes. The MGC cloning effort initially relied on random expressed sequence tag screening of cDNA libraries. Here, we summarize our recent progress(More)
Preaxial polydactyly is a congenital hand malformation that includes duplicated thumbs, various forms of triphalangeal thumbs, and duplications of the index finger. A locus for preaxial polydactyly has been mapped to a region of 1.9 cM on chromosome 7q36 between polymorphic markers D7S550 and D7S2423. We constructed a detailed physical map of the preaxial(More)