William J. Black

Learn More
INTRODUCTION In this paper, we describe the system used by the UMIST team as members of the FACILE consortium, to undertake the NE task in MUC-7. The main characteristics of this system employed are as follows: it is rule-based its rule formalism supports context-sensitive partial parsing rules may use pattern-matching-style iteration operators the notation(More)
UK PubMed Central (UKPMC) is a full-text article database that extends the functionality of the original PubMed Central (PMC) repository. The UKPMC project was launched as the first 'mirror' site to PMC, which in analogy to the International Nucleotide Sequence Database Collaboration, aims to provide international preservation of the open and free-access(More)
Text Mining is a relatively new area of research, very interesting for both computational linguists and data miners. It involves collecting and analyzing quantities of textual data by domain experts, whose main task is the manual revision of markup. We describe a suite of tools used to simplify the process: the Parmenides System that consists of data(More)
Successfully managing information means being able to find relevant new information and to correctly integrate it with pre-existing knowledge. Much information is nowadays stored as multilingual textual data; therefore advanced classification systems are currently considered as strategic components for effective knowledge management. We describe an(More)
We describe our last results at the CoNLL2002 shared task of Named Entity Recognition and Classiication using two approaches that we rst applied to other NLL problems. We have been developing our own modiied TBL learner initially to tackle the Part-of-Speech tagging problem, for integration in a hybrid NLL and rule-based system for information extraction(More)
This paper describes an advanced system for multilingual text classification adaptable to different user needs. The system has been initially developed as an applied research project involving both research centres, industrial bodies and end-user organizations. The project is a considerable success story in the financial field. Three different successful(More)
Strategic decision making, especially in the areas of business intelligence and competitive intelligence, requires the acquisition of decision-relevant information pieces like market trends, fusions and company values. This information is extracted by pre-processing and querying multiple sources, combining and condensing the findings. It is characteristic(More)
Research on specialized biological systems is often hampered by a lack of consistent terminology, especially across species. In bacterial Type IV secretion systems genes within one set of orthologs may have over a dozen different names. Classifying research publications based on biological processes, cellular components, molecular functions, and(More)