Open Information Extraction from the Web
@article{Banko2008OpenIE, title={Open Information Extraction from the Web}, author={M. Banko and Michael J. Cafarella and S. Soderland and M. Broadhead and Oren Etzioni}, journal={Commun. ACM}, year={2008}, volume={51}, pages={68-74} }
Traditionally, Information Extraction (IE) has focused on satisfying precise, narrow, pre-specified requests from small homogeneous corpora (e.g., extract the location and time of seminars from a set of announcements. [...] Key Method The paper also introduces TEXTRUNNER, a fully implemented, highly scalable OIE system where the tuples are assigned a probability and indexed to support efficient extraction and exploration via user queries. We report on experiments over a 9,000,000 Web page corpus that compare…Expand Abstract
Supplemental Content
Presentation Slides
2,111 Citations
Prioritization of Domain-Specific Web Information Extraction
- Computer Science
- AAAI
- 2010
- 10
- Highly Influenced
- PDF
References
SHOWING 1-2 OF 2 REFERENCES
YAGO: A Large Ontology from Wikipedia and WordNet
- Computer Science
- J. Web Semant.
- 2008
- 806
- Highly Influential
- PDF
Automatically semantifying wikipedia
- Proceedings of 16th Conference on Information and Knowledge Management (CIKM)
- 2007