Xiaodong Fang

Learn More
Next-generation massively parallel DNA sequencing technologies provide ultrahigh throughput at a substantially lower unit data cost; however, the data are very short read length sequences, making de novo assembly extremely challenging. Here, we describe a novel method for de novo assembly of large genomes from short read sequences. We successfully assembled(More)
Next-generation massively parallel sequencing technologies provide ultrahigh throughput at two orders of magnitude lower unit cost than capillary Sanger sequencing technology. One of the key applications of next-generation sequencing is studying genetic variation between individuals using whole-genome or target region resequencing. Here, we have developed a(More)
Cucumber is an economically important crop as well as a model system for sex determination studies and plant vascular biology. Here we report the draft genome sequence of Cucumis sativus var. sativus L., assembled using a novel combination of traditional Sanger and next-generation Illumina GA sequencing technologies to obtain 72.2-fold genome coverage. The(More)
The Pacific oyster Crassostrea gigas belongs to one of the most species-rich but genomically poorly explored phyla, the Mollusca. Here we report the sequencing and assembly of the oyster genome using short reads and a fosmid-pooling strategy, along with transcriptomes of development and stress response and the proteome of the shell. The oyster genome is(More)
Using next-generation sequencing technology alone, we have successfully generated and assembled a draft sequence of the giant panda genome. The assembled contigs (2.25 gigabases (Gb)) cover approximately 94% of the whole genome, and the remaining gaps (0.05 Gb) seem to contain carnivore-specific repeats and tandem repeats. Comparisons with the dog and human(More)
The causes of amyotrophic lateral sclerosis (ALS), a devastating human neurodegenerative disease, are poorly understood, although the protein TDP-43 has been suggested to have a critical role in disease pathogenesis. Here we show that ataxin 2 (ATXN2), a polyglutamine (polyQ) protein mutated in spinocerebellar ataxia type 2, is a potent modifier of TDP-43(More)
Here we present the first diploid genome sequence of an Asian individual. The genome was sequenced to 36-fold average coverage using massively parallel sequencing technology. We aligned the short reads onto the NCBI human reference genome to 99.97% coverage, and guided by the reference genome, we used uniquely mapped reads to assemble a high-quality(More)
The organized societies of ants include short-lived worker castes displaying specialized behavior and morphology and long-lived queens dedicated to reproduction. We sequenced and compared the genomes of two socially divergent ant species: Camponotus floridanus and Harpegnathos saltator. Both genomes contained high amounts of CpG, despite the presence of DNA(More)
TDP-43 and FUS are RNA-binding proteins that form cytoplasmic inclusions in some forms of amyotrophic lateral sclerosis (ALS) and frontotemporal lobar degeneration (FTLD). Moreover, mutations in TDP-43 and FUS are linked to ALS and FTLD. However, it is unknown whether TDP-43 and FUS aggregate and cause toxicity by similar mechanisms. Here, we exploit a(More)
Understanding the dynamics of eukaryotic transcriptome is essential for studying the complexity of transcriptional regulation and its impact on phenotype. However, comprehensive studies of transcriptomes at single base resolution are rare, even for modern organisms, and lacking for rice. Here, we present the first transcriptome atlas for eight organs of(More)