Skip to search formSkip to main content
You are currently offline. Some features of the site may not work correctly.

Wrapper (data mining)

Known as: Wrapper, Wrapper induction 
Wrapper in data mining is a program that extracts content of a particular information source and translates it into a relational form. Many web pages… Expand
Wikipedia

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
2016
2016
In this paper, we present a meta-analysis of several Web content extraction algorithms, and make recommendations for the future… Expand
  • table I
  • figure 1
  • table II
  • table IV
  • figure 2
2012
2012
The NLP Interchange Format (NIF) is an RDF/OWL-based format that provides interoperability between Natural Language Processing… Expand
  • figure 1
  • figure 2
  • figure 3
2011
2011
Information distributed through the Web keeps growing faster day by day, and for this reason, several techniques for extracting… Expand
  • figure 1
  • figure 2
  • table I
  • table II
  • figure 3
Highly Cited
2011
Highly Cited
2011
We present a generic framework to make wrapper induction algorithms tolerant to noise in the training data. This enables us to… Expand
  • figure 1
  • figure 2
  • table 1
  • figure 4
  • figure 3
Highly Cited
2004
Highly Cited
2004
With the tremendous amount of information that becomes available on the Web on a daily basis, the ability to quickly develop… Expand
  • figure 1
  • figure 2
  • figure 6
  • table I
  • table II
Highly Cited
2004
Highly Cited
2004
This work applies boosted wrapper induction (BWI), a machine learning algorithm for information extraction from semi-structured… Expand
  • table 1
  • table 2
  • table 4
  • table 3
  • table 5
Highly Cited
2003
Highly Cited
2003
Many tools have been developed to help users query, extract and integrate data from web pages generated dynamically from… Expand
  • figure 1
  • figure 3
  • figure 4
  • figure 5
  • figure 6
2003
2003
Several commercial applications, such as online comparison shopping and process automation, require integrating information that… Expand
Review
2000
Review
2000
The Internet presents numerous sources of useful information—telephone directories, product catalogs, stock quotes, event… Expand
  • figure 1
  • figure 6
  • figure 7
  • figure 8
  • figure 9
Review
1997
Review
1997
Many Internet information resources present relational data|telephone directories, product catalogs, etc. Because these sites are… Expand
  • figure 2
  • figure 1
  • figure 3
  • figure 5
  • figure 7