Skip to search formSkip to main content
You are currently offline. Some features of the site may not work correctly.

Wrapper (data mining)

Known as: Wrapper, Wrapper induction 
Wrapper in data mining is a program that extracts content of a particular information source and translates it into a relational form. Many web pages… Expand
Wikipedia

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
2016
2016
In this paper, we present a meta-analysis of several Web content extraction algorithms, and make recommendations for the future… Expand
  • table I
  • figure 1
  • table II
  • table IV
  • figure 2
Is this relevant?
Highly Cited
2011
Highly Cited
2011
We present a generic framework to make wrapper induction algorithms tolerant to noise in the training data. This enables us to… Expand
  • figure 1
  • figure 2
  • table 1
  • figure 4
  • figure 3
Is this relevant?
2011
2011
Information distributed through the Web keeps growing faster day by day, and for this reason, several techniques for extracting… Expand
  • figure 1
  • figure 2
  • table I
  • table II
  • figure 3
Is this relevant?
Highly Cited
2011
Highly Cited
2011
Structured data, in the form of entities and associated attributes, has been a rich web resource for search engines and knowledge… Expand
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • figure 5
Is this relevant?
Highly Cited
2004
Highly Cited
2004
With the tremendous amount of information that becomes available on the Web on a daily basis, the ability to quickly develop… Expand
  • figure 1
  • figure 2
  • figure 6
  • table I
  • table II
Is this relevant?
Highly Cited
2004
Highly Cited
2004
This work applies boosted wrapper induction (BWI), a machine learning algorithm for information extraction from semi-structured… Expand
  • table 1
  • table 2
  • table 4
  • table 3
  • table 5
Is this relevant?
Highly Cited
2003
Highly Cited
2003
Many tools have been developed to help users query, extract and integrate data from web pages generated dynamically from… Expand
  • figure 1
  • figure 3
  • figure 4
  • figure 5
  • figure 6
Is this relevant?
Highly Cited
2003
Highly Cited
2003
Data extraction from web pages is performed by software modules called wrappers. Recently, some systems for the automatic… Expand
  • figure 1
  • figure 2
  • figure 4
  • figure 3
Is this relevant?
Highly Cited
1999
Highly Cited
1999
With the tremendous amount of information that becomes available on the Web on a daily basis, the ability to quickly develop… Expand
  • figure 1
  • figure 2
  • figure 5
  • figure 8
  • table 1
Is this relevant?
Highly Cited
1998
Highly Cited
1998
Information mediators are systems capable of providing a unified view of several information sources. Central to any mediator… Expand
  • figure 1
  • figure 5
  • figure 7
  • figure 8
  • figure 9
Is this relevant?