Marcelo Bento Soares

Learn More
We present a draft sequence of the genome of Aedes aegypti, the primary vector for yellow fever and dengue fever, which at approximately 1376 million base pairs is about 5 times the size of the genome of the malaria vector Anopheles gambiae. Nearly 50% of the Ae. aegypti genome consists of transposable elements. These contribute to a factor of approximately(More)
The National Institutes of Health Mammalian Gene Collection (MGC) Program is a multiinstitutional effort to identify and sequence a cDNA clone containing a complete ORF for each human and mouse gene. ESTs were generated from libraries enriched for full-length cDNAs and analyzed to identify candidate full-ORF clones, which then were sequenced to high(More)
Large-scale sequencing of cDNAs randomly picked from libraries has proven to be a very powerful approach to discover (putatively) expressed sequences that, in turn, once mapped, may greatly expedite the process involved in the identification and cloning of human disease genes. However, the integrity of the data and the pace at which novel sequences can be(More)
To accelerate the molecular analysis of behavior in the honey bee (Apis mellifera), we created expressed sequence tag (EST) and cDNA microarray resources for the bee brain. Over 20,000 cDNA clones were partially sequenced from a normalized (and subsequently subtracted) library generated from adult A. mellifera brains. These sequences were processed to(More)
The neuronal ceroid lipofuscinoses (NCLs) are a genetically heterogeneous group of progressive neurodegenerative disorders characterized by the accumulation of autofluorescent lipopigment in various tissues. Progressive epilepsy with mental retardation (EPMR, MIM 600143) was recently recognized as a new NCL subtype (CLN8). It is an autosomal recessive(More)
The National Institutes of Health's Mammalian Gene Collection (MGC) project was designed to generate and sequence a publicly accessible cDNA resource containing a complete open reading frame (ORF) for every human and mouse gene. The project initially used a random strategy to select clones from a large number of cDNA libraries from diverse tissues.(More)
Schistosoma mansoni is the primary causative agent of schistosomiasis, which affects 200 million individuals in 74 countries. We generated 163,000 expressed-sequence tags (ESTs) from normalized cDNA libraries from six selected developmental stages of the parasite, resulting in 31,000 assembled sequences and 92% sampling of an estimated 14,000 gene(More)
Dinoflagellates are important marine primary producers and grazers and cause toxic "red tides". These taxa are characterized by many unique features such as immense genomes, the absence of nucleosomes, and photosynthetic organelles (plastids) that have been gained and lost multiple times. We generated EST sequences from non-normalized and normalized cDNA(More)
Using the data set of 180,000 expressed sequence tags (ESTs) of the blood fluke Schistosoma mansoni generated recently by our group, we identified three novel long-terminal-repeat (LTR)- and one novel non-LTR-expressed retrotransposon, named Saci-1, -2, and -3 and Perere, respectively. Full-length sequences were reconstructed from ESTs and have deduced open(More)