• Publications
  • Influence
A Study on Similarity and Relatedness Using Distributional and WordNet-based Approaches
This paper presents and compares WordNet-based and distributional similarity approaches. The strengths and weaknesses of each approach regarding similarity and relatedness tasks are discussed, and aExpand
  • 777
  • 132
  • Open Access
Using Encyclopedic Knowledge for Named entity Disambiguation
We present a new method for detecting and disambiguating named entities in open domain text. A disambiguation SVM kernel is trained to exploit the high coverage and rich structure of the knowledgeExpand
  • 858
  • 76
  • Open Access
Recovering Semantics of Tables on the Web
The Web offers a corpus of over 100 million tables [6], but the meaning of each table is rarely explicit from the table itself. Header rows exist in few cases and even when they do, the attributeExpand
  • 297
  • 33
  • Open Access
FALCON: Boosting Knowledge for Answer Engines
This paper discusses FALCON, an answer engine that integrates different forms of syntactic, semantic and pragmatic knowledge for the goal of achieving better performance.
  • 315
  • 19
LASSO: A Tool for Surfing the Answer Net
This paper presents the architecture, operation and results obtained with the Lasso system developed in the Natural Language Processing Laboratory at SMU. The system relies on a combination ofExpand
  • 208
  • 17
Weakly-supervised discovery of named entities using web search queries
A seed-based framework for textual information extraction allows for weakly supervised extraction of named entities from anonymized Web search queries. The extraction is guided by a small set of seedExpand
  • 135
  • 17
Performance issues and error analysis in an open-domain question answering system
This paper presents an in-depth analysis of a state-of-the-art Question Answering system. Several scenarios are examined: (1) the performance of each module in a serial baseline system, (2) theExpand
  • 201
  • 15
  • Open Access
Names and Similarities on the Web: Fact Extraction in the Fast Lane
In a new approach to large-scale extraction of facts from unstructured text, distributional similarities become an integral part of both the iterative acquisition of high-coverage contextualExpand
  • 105
  • 12
  • Open Access
The Structure and Performance of an Open-Domain Question Answering System
This paper presents the architecture, operation and results obtained with the LASSO Question Answering system developed in the Natural Language Processing Laboratory at SMU. To find answers, theExpand
  • 184
  • 10
  • Open Access
Acquisition of categorized named entities for web search
The recognition of names and their associated categories within unstructured text traditionally relies on semantic lexicons and gazetteers. The amount of effort required to assemble large lexiconsExpand
  • 143
  • 9
  • Open Access