Learning Information Extraction Rules for Semi-Structured and Free Text

@article{Soderland2004LearningIE,
  title={Learning Information Extraction Rules for Semi-Structured and Free Text},
  author={S. Soderland},
  journal={Machine Learning},
  year={2004},
  volume={34},
  pages={233-272}
}
A wealth of on-line text information can be made available to automatic processing by information extraction (IE) systems. Each IE application needs a separate set of rules tuned to the domain and writing style. WHISK helps to overcome this knowledge-engineering bottleneck by learning text extraction rules automatically.WHISK is designed to handle text styles ranging from highly structured to free text, including text that is neither rigidly formatted nor composed of grammatical sentences. Such… Expand
1,060 Citations
Global Rule Induction for Information Extraction
  • 2
A global rule induction approach to information extraction
  • J. Xiao, Tat-Seng Chua, J. Liu
  • Computer Science
  • Proceedings. 15th IEEE International Conference on Tools with Artificial Intelligence
  • 2003
  • 9
  • Highly Influenced
Looking for Information in Documents
Bottom-Up Relational Learning of Pattern Matching Rules for Information Extraction
  • 233
  • Highly Influenced
  • PDF
Hidden Markov Models and Text Classifiers for Information Extraction on Semi-Structured Texts
  • 2
  • PDF
Mining with Information Extraction
  • 2
  • PDF
EXTRACTING CHEMICAL INFORMATION FROM THAI UNSTRUCTURED TEXT WITH UNKNOWN PHRASE BOUNDARIES
  • Highly Influenced
  • PDF
Information extraction from unstructured web text
  • 28
  • PDF
A Multi-resolution Framework for Information Extraction from Free Text
  • 48
  • PDF
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 36 REFERENCES
Relational Learning of Pattern-Match Rules for Information Extraction
  • 672
  • Highly Influential
  • PDF
Learning information extraction patterns from examples
  • S. Huffman
  • Computer Science
  • Learning for Natural Language Processing
  • 1995
  • 179
  • PDF
Learning text analysis rules for domain-specific natural language processing
  • 82
  • Highly Influential
Multistrategy Learning for Information Extraction
  • 125
  • PDF
Acquisition of semantic patterns for information extraction from corpora
  • J. Kim, D. Moldovan
  • Computer Science
  • Proceedings of 9th IEEE Conference on Artificial Intelligence for Applications
  • 1993
  • 78
  • Highly Influential
CRYSTAL: Inducing a Conceptual Dictionary
  • 399
  • PDF
A sequential algorithm for training text classifiers
  • 2,185
  • PDF
Wrapper generation for semi-structured Internet sources
  • 331
  • PDF
...
1
2
3
4
...