Corpus ID: 3179848

Learning to Extract Relations from the Web using Minimal Supervision

@inproceedings{Bunescu2007LearningTE,
  title={Learning to Extract Relations from the Web using Minimal Supervision},
  author={Razvan C. Bunescu and Raymond J. Mooney},
  booktitle={ACL},
  year={2007}
}
We present a new approach to relation extraction that requires only a handful of training examples. Given a few pairs of named entities known to exhibit or not exhibit a particular relation, bags of sentences containing the pairs are extracted from the web. We extend an existing relation extraction method to handle this weaker form of supervision, and present experimental results demonstrating that our approach can reliably extract relations from web documents. 

Figures, Tables, and Topics from this paper

Distant Supervision for Relation Extraction with an Incomplete Knowledge Base
TLDR
It is shown that a significant number of “negative“ examples generated by the labeling process are false negatives because the knowledge base is incomplete, therefore the heuristic for generating negative examples has a seriousflaw. Expand
Infusion of Labeled Data into Distant Supervision for Relation Extraction
TLDR
This paper demonstrates how a state-of-theart multi-instance multi-label model can be modified to make use of reliable sentence-level labels in addition to the relation-level distant supervision from a database. Expand
End-to-End Relation Extraction Using Distant Supervision from External Semantic Repositories
In this paper, we extend distant supervision (DS) based on Wikipedia for Relation Extraction (RE) by considering (i) relations defined in external repositories, e.g. YAGO, and (ii) any subset ofExpand
Non-Expert Correction of Automatically Generated Relation Annotations
TLDR
A new way to collect human annotated relations in text using Amazon Mechanical Turk is explored, which describes a design which encourages user efficiency and aids discovery of cheating. Expand
Distant Supervised Relation Extraction with Wikipedia and Freebase
In this paper we discuss a new approach to extract relational data from unstructured text without the need of hand labeled data. Socalled distant supervision has the advantage that it scales largeExpand
Application-Driven Relation Extraction with Limited Distant Supervision
TLDR
This work learns entity filters jointly with relation extraction using imitation learning to overcome the issue of few instances available to seed the relation extraction process, and appropriate named entity recognizers which are necessary for pre-processing do not exist. Expand
Relation Extraction with Weak Supervision and Distributional Semantics
TLDR
An effective yet efficient algorithm that combines the power of various semantic resources that are automatically mined from a corpus based on distributional semantics that is able to extract a very large set of relations from the web at high precision. Expand
Joint Distant and Direct Supervision for Relation Extraction
TLDR
This paper proposes a joint model between Web data derived with DS and manually annotated data from ACE, and shows an improvement on the previous state-of-the-art in ACE and a rather good accuracy on extracting 52 types of relations from Web data, which suggests the applicability of DS for general RE. Expand
Filling Knowledge Base Gaps for Distant Supervision of Relation Extraction
TLDR
This work proposes a simple yet novel framework that combines a passage retrieval model using coarse features into a state-of-the-art relation extractor using multi-instance learning with fine features, and adapts the information retrieval technique of pseudorelevance feedback to expand knowledge bases. Expand
Modeling Relations and Their Mentions without Labeled Text
TLDR
A novel approach to distant supervision that can alleviate the problem of noisy patterns that hurt precision by using a factor graph and applying constraint-driven semi-supervision to train this model without any knowledge about which sentences express the relations in the authors' training KB. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 17 REFERENCES
Dependency Tree Kernels for Relation Extraction
We extend previous work on tree kernels to estimate the similarity between the dependency trees of sentences. Using this kernel within a Support Vector Machine, we detect and classify relationsExpand
Unsupervised named-entity extraction from the Web: An experimental study
TLDR
An overview of KnowItAll's novel architecture and design principles is presented, emphasizing its distinctive ability to extract information without any hand-labeled training examples, and three distinct ways to address this challenge are presented and evaluated. Expand
Subsequence Kernels for Relation Extraction
We present a new kernel method for extracting semantic relations between entities in natural language text, based on a generalization of subsequence kernels. This kernel uses three types ofExpand
Kernel Methods for Relation Extraction
TLDR
This work introduces kernels defined over shallow parse representations of text, and design efficient algorithms for computing the kernels, and uses the devised kernels in conjunction with Support Vector Machine and Voted Perceptron learning algorithms for the task of extracting person-affiliation and organization-location relations from text. Expand
Constructing Biological Knowledge Bases by Extracting Information from Text Sources
TLDR
A research effort aimed at automatically mapping information from text sources into structured representations, such as knowledge bases, is begun, to use machine-learning methods to induce routines for extracting facts from text. Expand
Automatic Acquisition of Hyponyms from Large Text Corpora
TLDR
A set of lexico-syntactic patterns that are easily recognizable, that occur frequently and across text genre boundaries, and that indisputably indicate the lexical relation of interest are identified. Expand
Contrastive Estimation: Training Log-Linear Models on Unlabeled Data
TLDR
A novel approach, contrastive estimation, is described, which outperforms EM, is more robust to degradations of the dictionary, and can largely recover by modeling additional features. Expand
Content-Based Image Retrieval Using Multiple-Instance Learning
TLDR
The multiple-instance (MI) learning model is applied to use a small number of training images to learn what images from the database are of interest to the user. Expand
Supervised versus multiple instance learning: an empirical comparison
TLDR
This work empirically study the relationship between supervised and multiple instance (MI) learning by looking at a cross section of MI data sets from various domains coupled with a number of learning algorithms including Diverse Density, Logistic Regression, nonlinear Support Vector Machines and FOIL. Expand
Multi-Instance Kernels
TLDR
A kernel on multi-instance data that can be shown to separate positive and negative sets under natural assumptions is shown and compares favorably with state of the art multi- instance learning algorithms in an empirical study. Expand
...
1
2
...