Marc E. Colosimo

Learn More
Biology has now become an information science, and researchers are increasingly dependent on expert-curated biological databases to organize the findings from the published literature. We report here on a series of experiments related to the application of natural language processing to aid in the curation process for FlyBase. We focused on listing the(More)
Neuronal identities are specified by the combinatorial functions of activators and repressors of gene expression. Members of the well-conserved Olf/EBF (O/E) transcription factor family have been shown to play important roles in neuronal and non-neuronal development and differentiation. O/E proteins are highly expressed in the olfactory epithelium, and O/E(More)
BACKGROUND We prepared and evaluated training and test materials for an assessment of text mining methods in molecular biology. The goal of the assessment was to evaluate the ability of automated systems to generate a list of unique gene identifiers from PubMed abstracts for the three model organisms Fly, Mouse, and Yeast. This paper describes the(More)
Most C. elegans sensory neuron types consist of a single bilateral pair of neurons, and respond to a unique set of sensory stimuli. Although genes required for the development and function of individual sensory neuron types have been identified in forward genetic screens, these approaches are unlikely to identify genes that when mutated result in subtle or(More)
We have developed a challenge task for the second BioCreAtIvE (Critical Assessment of Information Extraction in Biology) that requires participating systems to provide lists of the EntrezGene (formerly LocusLink) identifiers for all human genes and proteins mentioned in a MEDLINE abstract. We are distributing 281 annotated abstracts and another 5,000(More)
BACKGROUND Phylogenetic trees are widely used to visualize evolutionary relationships between different organisms or samples of the same organism. There exists a variety of both free and commercial tree visualization software available, but limitations in these programs often require researchers to use multiple programs for analysis, annotation, and the(More)
BACKGROUND Current sequencing technology makes it practical to sequence many samples of a given organism, raising new challenges for the processing and interpretation of large genomics data sets with associated metadata. Traditional computational phylogenetic methods are ideal for studying the evolution of gene/protein families and using those to infer the(More)
Nuclear receptors regulate numerous critical biological processes. The C. elegans genome is predicted to encode approximately 270 nuclear receptors of which >250 are unique to nematodes. ODR-7 is the only member of this large divergent family whose functions have been defined genetically. ODR-7 is expressed in the AWA olfactory neurons and specifies AWA(More)
  • 1