• Publications
  • Influence
Anaphora for Everyone: Pronominal Anaphora Resolution without a Parser
TLDR
We present an algorithm for anaphora resolution which is a modified and extended version of that developed by (Lappin and Leass, 1994). Expand
  • 340
  • 34
  • PDF
TimeML-Compliant Text Analysis for Temporal Reasoning
TLDR
We address this problem with a hybrid TimeML annotator, which uses cascaded finite-state grammars (for temporal expression analysis, shallow syntactic parsing, and feature generation) together with a machine learning component capable of effectively using large amounts of unannotated data. Expand
  • 129
  • 14
  • PDF
Automatic Glossary Extraction: Beyond Terminology Identification
TLDR
This paper describes a method for automatically extracting domain-specific glossaries from large document collections, and presents an informal evaluation of its performance. Expand
  • 169
  • 12
  • PDF
Question analysis: How Watson reads a clue
TLDR
The first stage of processing in the IBM Watson™ system is to perform a detailed analysis of the question in order to determine what it is asking for and how best to approach answering it. Expand
  • 156
  • 11
  • PDF
Enjoy the paper: lexical semantics via lexicology
TLDR
In this paper, we motivate a particular approach to lexical semantics, briefly demonstrate its computational tractability, and explore the possibility of extracting the lexical information this approach requires from MRDs. Expand
  • 104
  • 10
  • PDF
Finding needles in the haystack: Search and candidate generation
TLDR
A key phase in the DeepQA architecture is Hypothesis Generation, in which candidate system responses are generated for downstream scoring and ranking. Expand
  • 71
  • 7
  • PDF
The Derivation of a Grammatically Indexed Lexicon from the Longman Dictionary of Contemporary English
TLDR
We describe a methodology and associated software system for the construction of a large lexicon from an existing machine-readable (published) dictionary which contains partial (and occasionally in-accurate) information. Expand
  • 87
  • 6
  • PDF
Natural Language Engineering
This paper describes an environment for the generation of non-deterministic taggers, currently used for the development of a Spanish lexicon. In relation to previous approaches, our system includesExpand
  • 65
  • 6
TimeBank-Driven TimeML Analysis
  • B. Boguraev, R. Ando
  • Computer Science
  • Annotating, Extracting and Reasoning about Time…
  • 2005
TLDR
The design of TimeML as an expressive language for temporal information brings promises, and challenges; in particular, its representa- tional properties raise the bar for traditional information extraction meth- ods applied to the task. Expand
  • 37
  • 6
  • PDF
Deep parsing in Watson
Two deep parsing components, an English Slot Grammar (ESG) parser and a predicate-argument structure (PAS) builder, provide core linguistic analyses of both the questions and the text content used byExpand
  • 105
  • 5
  • PDF
...
1
2
3
4
5
...