Learn More
The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned(More)
The Ensembl (http://www.ensembl.org/) database project provides a bioinformatics framework to organise biology around the sequences of large genomes. It is a comprehensive source of stable automatic annotation of the human genome sequence, with confirmed gene predictions that have been integrated with external data sources, and is available as either an(More)
The Bioperl project is an international open-source collaboration of biologists, bioinformaticians, and computer scientists that has evolved over the past 7 yr into the most comprehensive library of Perl modules available for managing and manipulating life-science information. Bioperl provides an easy-to-use, stable, and consistent programming interface for(More)
This study describes comprehensive polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome. We identify the 5' and 3' boundaries of 181,047 transcripts with extensive variation in transcripts arising from alternative promoter usage, splicing, and(More)
The compact genome of Fugu rubripes has been sequenced to over 95% coverage, and more than 80% of the assembly is in multigene-sized scaffolds. In this 365-megabase vertebrate genome, repetitive DNA accounts for less than one-sixth of the sequence, and gene loci occupy about one-third of the genome. As with the human genome, gene loci are not evenly(More)
Most of the mammalian genome is transcribed. This generates a vast repertoire of transcripts that includes protein-coding messenger RNAs, long non-coding RNAs (lncRNAs) and repetitive sequences, such as SINEs (short interspersed nuclear elements). A large percentage of ncRNAs are nuclear-enriched with unknown function. Antisense lncRNAs may form(More)
Evolution is typically thought to proceed through divergence of genes, proteins and ultimately phenotypes. However, similar traits might also evolve convergently in unrelated taxa owing to similar selection pressures. Adaptive phenotypic convergence is widespread in nature, and recent results from several genes have suggested that this phenomenon is(More)
Molecular phylogenetics has rapidly established the evolutionary positions of most major mammal groups, yet analyses have repeatedly failed to agree on that of bats (order Chiroptera). Moreover, the relationship among the major bat lineages has proven equally contentious, with ongoing disagreements about whether echolocating bats are paraphyletic or a true(More)
Metachromatic leukodystrophy (MLD) is an inherited lysosomal storage disease caused by arylsulfatase A (ARSA) deficiency. Patients with MLD exhibit progressive motor and cognitive impairment and die within a few years of symptom onset. We used a lentiviral vector to transfer a functional ARSA gene into hematopoietic stem cells (HSCs) from three(More)
Barrett's esophagus is an increasingly common disease that is strongly associated with reflux of stomach acid and usually a hiatus hernia, and it strongly predisposes to esophageal adenocarcinoma (EAC), a tumor with a very poor prognosis. We report the first genome-wide association study on Barrett's esophagus, comprising 1,852 UK cases and 5,172 UK(More)