• Publications
  • Influence
A survey of named entity recognition and classification
TLDR
Observations about languages, named entity types, domains and textual genres studied in the literature, along with other critical aspects of NERC such as features and evaluation methods, are reported. Expand
The SemEval-2007 WePS Evaluation: Establishing a benchmark for the Web People Search Task
TLDR
The task definition, resources, participation, and comparative results for the Web People Search task, which was organized as part of the SemEval-2007 evaluation exercise, are presented. Expand
Discovering Relations among Named Entities from Large Corpora
TLDR
Using one year of newspapers reveals not only that the relations among named entities could be detected with high recall and precision, but also that appropriate labels could be automatically provided for the relations. Expand
Extended Named Entity Hierarchy
TLDR
A Named Entity hierarchy which contains about 150 NE types is proposed and it is proposed that this resource for any application should be provided. Expand
WePS 2 Evaluation Campaign: Overview of the Web People Search Clustering Task
TLDR
The definition, resources, methodology and evaluation metrics, participation and comparative results for the clustering task are presented. Expand
An Improved Extraction Pattern Representation Model for Automatic IE Pattern Acquisition
TLDR
A new model, the Subtree model, based on arbitrary subtrees of dependency trees, is introduced, describing a discovery procedure for this model and demonstrating experimentally an improvement in recall using Subtree patterns. Expand
Automatic paraphrase acquisition from news articles
TLDR
This is the initial attempt at automatically extracting paraphrases from a corpus, and the results are promising. Expand
Japanese Dependency Structure Analysis Based on Maximum Entropy Models
TLDR
A dependency structure analysis of Japanese sentences based on the maximum entropy models is described, created by learning the weights of some features from a training corpus to predict the dependency between bunsetsus or phrasal units. Expand
Preemptive Information Extraction using Unrestricted Relation Discovery
TLDR
A technique called Unrestricted Relation Discovery is proposed that discovers all possible relations from texts and presents them as tables in order to extend the boundary of Information Extraction systems. Expand
Semi-supervised Relation Extraction with Large-scale Word Clustering
TLDR
A simple semi-supervised relation extraction system with large-scale word clustering that consistently outperformed a state-of-the-art supervised baseline system when training on different sizes of data. Expand
...
1
2
3
4
5
...