Automatic Extraction of Facts from Press Releases to Generate News Stories

  title={Automatic Extraction of Facts from Press Releases to Generate News Stories},
  author={Peggy M. Andersen and Philip J. Hayes and Steven P. Weinstein and Alison K. Huettner and Linda M. Schmandt and Irene B. Nirenburg},
While complete understanding of arbitrary input text remains in the future, it is currently possible to construct natural language processing systems that provide a partial understanding of text with limited accuracy. Moreover, such systems can provide cost-effective solutions to commercially-significant business problems. This paper describes one such system: JASPER. JASPER is a fact extraction system recently developed and deployed by Carnegie Group for Reuters Ltd. JASPER uses a template… Expand
Information Extraction: Past, Present and Future
Although the targeted nature of this task makes it more tractable than some of the more open-ended tasks in NLP, it is replete with challenges as the information landscape evolves, which also makes it an exciting research subject. Expand
Chapter 2 Information Extraction : Past , Present and Future
In this chapter we present a brief overview of Information Extraction, which is an area of natural language processing that deals with finding factual information in free text. In formal terms, factsExpand
Template Mining for Information Extraction from Digital Documents
This article briefly reviews template mining research and shows how templates are used in Web search engines- such as Alta Vista-and in meta-search engines-such as Ask Jeeves-for helping end-users generate natural language search expressions. Expand
Information Extraction: Beyond Document Retrieval
In this paper we give a synoptic view of the growth of the text processing technology of information extraction (IE) whose function is to extract information about a pre‐specified set of entities,Expand
Iterative joint extraction of entities, relationships and coreferences from text sources
  • Slavko Zitnik, M. Bajec
  • Computer Science
  • 2015 IEEE 9th International Conference on Research Challenges in Information Science (RCIS)
  • 2015
A novel iterative and joint information extraction system that interconnects all the three tasks together using iterative feature functions which use the advantage of the intermediate extractions and obtained a 15% error reduction on named entity recognition over individual models. Expand
Advances in Information Extraction
This chapter gives a comprehensive introduction to information extraction technology including design, processing natural language, and evaluation issues of IE systems, and demonstrates an enormous indexing potential of lightweight linguistic text processing techniques in other areas of information technology closely related to IE. Expand
EReXS : Event and Relations Extraction for SWHi by Proscovia Olango Summitted to the Department of Information
”Automatic event extraction from fulltext resources is a combination of human language technology (HLT) and semantic web technologies. It can also be done on the base of purely statistical means withExpand
Template mining for the extraction of citation from digital documents
This study hypothesizes that the template mining technique can be used to extract citation information from printed and digital full-text articles so that universal or semi-universal citation databases can be automatically established before too long in the future. Expand
Content management in the SYNDIKATE system - How technical documents are automatically transformed to text knowledge bases
This work presents a general system architecture which integrates requirements from the analysis of single sentences, as well as those of referentially linked sentences forming cohesive texts, for soundness and validity of the generated text representation structures. Expand
Information Extraction from Free-Text Business Documents
The enormous indexing potential of lightweight linguistic text processing techniques applied in information extraction systems and other closely related fields of information technology which concern processing vast amounts of textual data are demonstrated. Expand


A News Analysis System
This paper describes a prototype news analysis system which classifies and indexes news stories in real time, using a Government-Binding parser which processes each sentence of a news item. Expand
SCISOR: extracting information from on-line news
The future of natural language text processing is examined in the SCISOR prototype, drawing on artificial intelligence techniques, and applying them to financial news items through a combination of bottom-up and top-down processing. Expand
The PUNDIT natural-language processing system
The authors describe the PUNDIT (Prolog Understanding of Integrated Text) text-understanding system, which is designed to analyze and construct representations of paragraph-length text that is flexible, extensible and portable. Expand
TCS: a shell for content-based text categorization
The kind of application that the text categorization shell, TCS, can produce is characterized and how it meets its design goals are described, and examples of applications built with TCS are given. Expand
Interpretation as Abduction
An approach to abductive inference, called “weighted abduction”, that has resulted in a significant simplification of how the problem of interpreting texts is conceptualized, can be combined with the older view of “parsing as deduction” to produce an elegant and thorough integration of syntax, semantics, and pragmatics. Expand
ATRANS Automatic Processing of Money Transfer Messages
This paper illustrates the solutions to a number of problems that arise when an academic theory is applied to a real-world problem, and focuses on the problem of context localization in the absence of reliable syntactic clues, such as sentence boundaries. Expand
An Equipment Model and Its Role in the Interpretation of Nominal Compounds
This work shows problems which can be solved only if the system has access to a detailed equipment model, and discusses features of such models, in particular, their ability to simulate the equipment's behavior. Expand