Learn More
Lancelets ('amphioxus') are the modern survivors of an ancient chordate lineage, with a fossil record dating back to the Cambrian period. Here we describe the structure and gene content of the highly polymorphic approximately 520-megabase genome of the Florida lancelet Branchiostoma floridae, and analyse it in the context of chordate evolution. Whole-genome(More)
Despite the known existence of distant-acting cis-regulatory elements in the human genome, only a small fraction of these elements has been identified and experimentally characterized in vivo. This paucity of enhancer collections with defined activities has thus hindered computational approaches for the genome-wide prediction of enhancers and their(More)
The paucity of enzymes that efficiently deconstruct plant polysaccharides represents a major bottleneck for industrial-scale conversion of cellulosic biomass into biofuels. Cow rumen microbes specialize in degradation of cellulosic plant material, but most members of this complex community resist cultivation. To characterize biomass-degrading genes and(More)
A major yet unresolved quest in decoding the human genome is the identification of the regulatory sequences that control the spatial and temporal expression of genes. Distant-acting transcriptional enhancers are particularly challenging to uncover because they are scattered among the vast non-coding portion of the genome. Evolutionary sequence constraint(More)
Nonalcoholic fatty liver disease (NAFLD) is a burgeoning health problem of unknown etiology that varies in prevalence among ancestry groups. To identify genetic variants contributing to differences in hepatic fat content, we carried out a genome-wide association scan of nonsynonymous sequence variations (n = 9,229) in a population comprising Hispanic,(More)
Coronary heart disease (CHD) is a major cause of death in Western countries. We used genome-wide association scanning to identify a 58-kilobase interval on chromosome 9p21 that was consistently associated with CHD in six independent samples (more than 23,000 participants) from four Caucasian populations. This interval, which is located near the CDKN2A and(More)
The discovery of rare genetic variants is accelerating, and clear guidelines for distinguishing disease-causing sequence variants from the many potentially functional variants present in any human genome are urgently needed. Without rigorous standards we risk an acceleration of false-positive reports of causality, which would impede the translation of(More)
Extended perfect human-rodent sequence identity of at least 200 base pairs (ultraconservation) is potentially indicative of evolutionary or functional uniqueness. We used a transgenic mouse assay to compare the embryonic enhancer activity of 231 noncoding ultraconserved human genome regions with that of 206 extremely conserved regions lacking(More)
The functional consequences of genetic variation in mammalian regulatory elements are poorly understood. We report the in vivo dissection of three mammalian enhancers at single-nucleotide resolution through a massively parallel reporter assay. For each enhancer, we synthesized a library of >100,000 mutant haplotypes with 2-3% divergence from the wild-type(More)
Comparison of genomic DNA sequences from human and mouse revealed a new apolipoprotein (APO) gene (APOAV) located proximal to the well-characterized APOAI/CIII/AIV gene cluster on human 11q23. Mice expressing a human APOAV transgene showed a decrease in plasma triglyceride concentrations to one-third of those in control mice; conversely, knockout mice(More)