Learn More
Lancelets ('amphioxus') are the modern survivors of an ancient chordate lineage, with a fossil record dating back to the Cambrian period. Here we describe the structure and gene content of the highly polymorphic approximately 520-megabase genome of the Florida lancelet Branchiostoma floridae, and analyse it in the context of chordate evolution. Whole-genome(More)
Despite the known existence of distant-acting cis-regulatory elements in the human genome, only a small fraction of these elements has been identified and experimentally characterized in vivo. This paucity of enhancer collections with defined activities has thus hindered computational approaches for the genome-wide prediction of enhancers and their(More)
Coronary heart disease (CHD) is a major cause of death in Western countries. We used genome-wide association scanning to identify a 58-kilobase interval on chromosome 9p21 that was consistently associated with CHD in six independent samples (more than 23,000 participants) from four Caucasian populations. This interval, which is located near the CDKN2A and(More)
The paucity of enzymes that efficiently deconstruct plant polysaccharides represents a major bottleneck for industrial-scale conversion of cellulosic biomass into biofuels. Cow rumen microbes specialize in degradation of cellulosic plant material, but most members of this complex community resist cultivation. To characterize biomass-degrading genes and(More)
Nonalcoholic fatty liver disease (NAFLD) is a burgeoning health problem of unknown etiology that varies in prevalence among ancestry groups. To identify genetic variants contributing to differences in hepatic fat content, we carried out a genome-wide association scan of nonsynonymous sequence variations (n = 9,229) in a population comprising Hispanic,(More)
A major yet unresolved quest in decoding the human genome is the identification of the regulatory sequences that control the spatial and temporal expression of genes. Distant-acting transcriptional enhancers are particularly challenging to uncover because they are scattered among the vast non-coding portion of the genome. Evolutionary sequence constraint(More)
Agenesis of the corpus callosum (AgCC) is a congenital brain malformation that occurs in approximately 1:1,000-1:6,000 births. Several syndromes associated with AgCC have been traced to single gene mutations; however, the majority of AgCC causes remain unidentified. We investigated a mother and two children who all shared complete AgCC and a chromosomal(More)
Identifying the sequences that direct the spatial and temporal expression of genes and defining their function in vivo remains a significant challenge in the annotation of vertebrate genomes. One major obstacle is the lack of experimentally validated training sets. In this study, we made use of extreme evolutionary sequence conservation as a filter to(More)
Extended perfect human-rodent sequence identity of at least 200 base pairs (ultraconservation) is potentially indicative of evolutionary or functional uniqueness. We used a transgenic mouse assay to compare the embryonic enhancer activity of 231 noncoding ultraconserved human genome regions with that of 206 extremely conserved regions lacking(More)
Comparison of genomic DNA sequences from human and mouse revealed a new apolipoprotein (APO) gene (APOAV) located proximal to the well-characterized APOAI/CIII/AIV gene cluster on human 11q23. Mice expressing a human APOAV transgene showed a decrease in plasma triglyceride concentrations to one-third of those in control mice; conversely, knockout mice(More)