Learn More
Nineteen teams presented results for the Gene Mention Task at the BioCreative II Workshop. In this task participants designed systems to identify substrings in sentences corresponding to gene name mentions. A variety of different methods were used and the results varied with a highest achieved F1 score of 0.8721. Here we present brief descriptions of all(More)
BACKGROUND Genome sciences have experienced an increasing demand for efficient text-processing tools that can extract biologically relevant information from the growing amount of published literature. In response, a range of text-mining and information-extraction tools have recently been developed specifically for the biological domain. Such tools are only(More)
We present a part-of-speech tagger that achieves over 97% accuracy on MEDLINE citations. Availability: Software, documentation, and a corpus of 5 700 manually tagged sentences are available at MEDLINE is a bibliographic database of publications in health sciences, biology, and related fields. It currently contains over 12 million records, and nearly 7(More)
Morphological analysis as applied to English has generally involved the study of rules for inflections and derivations. Recent work has attempted to derive such rules from automatic analysis of corpora. Here we study similar issues, but in the context of the biological literature. We introduce a new approach which allows us to assign probabilities of the(More)
The PubMed search engine displays query results in reverse chronological order, which is appropriate for users interested in the latest publications. The purpose of this paper is to use machine learning to order documents by popularity , or the predicted frequency that an article is viewed by the average PubMed user. Other research on general search engine(More)
Intel Virtualization Technology V irtualizing a computing system's physical resources to achieve improved sharing and utilization has been well established for decades. 1 Full virtualization of all system resources−including processors, memory, and I/O devices−makes it possible to run multiple operating systems on a single physical platform. In a(More)
  • 1