• Corpus ID: 10237124

Information Extraction as a core language technology What is IE ?

  title={Information Extraction as a core language technology What is IE ?},
  author={Yorick Wilks},
Information Extraction (IE) technology is now coming on to the market and is of great significance to information enduser industries of all kinds, especially finance companies, banks, publishers and governments. For instance, finance companies want to know facts of the following sort and on a large scale: what company takeovers happened in a given time span; they want widely scattered text information reduced to a simple data base. Lloyds of London need to know of daily ship sinkings throughout… 
Information extraction on the semantic web: utilizing the resource description framework in information extraction
This work presents the extension of existing Information Extraction approaches by incorporating the graph-based nature of RDF, and contributes approaches for computing customizable and adaptable RDF views on the natural language content of Web pages.
The Annual Report Algorithm: Retrieval of Financial Statements and Extraction of Textual Information
Which financial statements are publicly available by analyzing the entire SEC EDGAR database since its implementation in 1993 and showing how to retrieve financial statements in a fast and efficient way fromEDGAR.
Below the Surface : Synthesising Technology for TheInformation
The problem of information overload to some extent obscures the rapid increase in specialised and novel information domains. Though the content of the web continues to expand the manner in which the
Enhancing knowledge acquisition with constraint technology
(Dis)Prove Goal (using Constraint Satisfaction) (using Karlsruhe Interactive Verifier) Properties Properties


Evaluation of an Algorithm for the Recognition and Classification of Proper Names
We describe an information extraction system in which four classes of naming expressions - organisation, person, location and time names - are recognised and classified with nearly 92% combined
Making Preferences More Active
  • Y. Wilks
  • Computer Science
    Artif. Intell.
  • 1978
It is argued that this would be a good context in which to place a language understander (rather than that of question-answering about a limited area of the real world, as is normal) and that the sense projection mechanisms suggested would provide a test-bed on which the usefulness of frames for language understanding could be realistically assessed.
Automatically Acquiring Conceptual Patterns without an Annotated Corpus
It is proposed that conceptual patterns for information extraction can be acquired automatically using only a preclassified training corpus and no text annotations, and a system called AutoSlog-TS is described, which is a variation of the previous AutoS Log system that runs exhaustively on an untagged text corpus.
Role of Word Sense Disalnbiguation in Lexical Acquisition: Predicting Semantics from Syntactic Cues
This paper addresses the issue of word-sense ambiguity in extraction from machine-readable resources for the construction of large-scale knowledge sources and shows that a 15-fold improvement can be achieved in deriving semantic information from syntactic cues if the authors first divide the Syntactic cues into distinct groupings that correlate with different word senses.
Dictionary word sense distinctions: An enquiry into their nature
The two studies described here look into their grounds for making distinctions, developing a classification scheme to describe the commonly occurring distinction types and a view of the ontological status of dictionary word senses.
Electric Words: Dictionaries, Computers, and Meanings
A short history of meaning symbolic accounts of definitional meaning primitives in meaning definition wordbooks as human artifacts and tasks and tools text analysis and its relationship to dictionaries.
FOUL-UP: A Program that Figures Out Meanings of Words from Context
The inferencing task of figuring out words from context is implemented in the presence of a large database of world knowledge to figure out likely definitions for unknown words, and to create context-specific definitions for such words.
On The Semantic Interpretation of Nominals
A structured semantic representation is proposed, the Lexical Conceptual Paradigm (LCP) which groups nouns into paradigmatic classes exhibiting like behavior, and argues that many cases of polysemy have well-defined calculi, which interact with the grammar in predictable and determinate ways for disambiguation.
Information Extraction, in (Y. Wilks, ed
  • Computers and the Humanities,
  • 1996
Information Extraction, in (Y. Wilks, ed
  • Computers and the Humanities,
  • 1996