The future of natural language text processing is examined in the SCISOR prototype. Drawing on artificial intelligence techniques, and applying them to financial news items, this powerful tool illustrates some of the future benefits of natural language analysis through a combination of bottom-up and top-down processing.
Ordinarily, one thinks of the problem of natural language understanding as one of making a single, left-to-right pass through an input, producing a progressively refined and detailed interpretation. In text interpretation, however, the constraints of strict left-to-right processing are an encumbrance. Multi-pass methods , especially by interpreting words… (More)
Recent work in text analysis has suggested that data on words that frequently occur together reveal important information about text content. Co-occurrence relations can serve two main purposes in language processing. First, the statistics of co-occurrence have been shown to produce accurate results in syntactic analysis. Second, the way that words appear… (More)
This is the first and final collection of a noteworthy set of research papers, which is both historical and timely. PLNLP, the Programming Language for Natural Language Processing, which emerged at IBM's Thomas J. Watson Research Center in Yorktown Heights about 15 years ago, is ostensibly the glue that holds the volume's 22 chapters together. But it is as… (More)
Neither natural language processing nor information retrieval is any longer a young field, but the two areas have yet to achieve a graceful interaction. Mainly, the reason for this incompatibility is that information retrieval technology depends upon relatively simple but robust methods, while natural language processing involves complex knowledge-based… (More)
Wc discuss ~r method for usi,,g ~ut,om~tted cor pllS ana.1'o111 li(!ws stories with brewed coverage in 7h~p~mv.~(~md English. Our al> pi'o~tch focuses Oll tyiil~ l.otAel.her word SOllS(~s> ilS-ing a. (:olribin~t~ion of workl kuowledge (ontology) with word knowledge (corpus da.t~t). ~;e expla.in l, he approach a.nd it;s results in SI[O(',UN.
This paper presents an overview of the TIPSTER/SHOGUN project, the major results, and the SHOGUN data extraction system. TIP-STER/SHOGUN was a joint effort of Management and Data Systems (formerly GE Aerospace), part of the ARPA TIPSTER Text program. Two of the main technical thrusts of the project were: (1) the development of a model of finite-state… (More)
PHRED (PHRasal English Diction is a natural language generator designed for use in a variety of domains. It was constructed to share a knowledge base with PHRAN (PHRasal ANalyzer) as part of a real-time user-friendly interface. The knowledge base consists of pattern-concept pairs, i.e., associations between linguistic structures and conceptual templates.… (More)
Language acquisition addresses two important text processing issues. The immediate problem is understanding a text in spite of the existence of lexical gaps. The long term issue is that the understander must incorporate new words into its lexicon for future use. This paper describes an approach to constructing new lexical entries in a gradual process by… (More)