Wrapper (data mining)

Known as: Wrapper, Wrapper induction

Wrapper in data mining is a program that extracts content of a particular information source and translates it into a relational form. Many web pages…

Wikipedia

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.

2016

Predicate enrichment of aligned XPaths for wrapper induction

2015

Early Steps Towards Web Scale Information Extraction with LODIE

Information extraction (IE) is the technique for transforming unstructured textual data into structured representation that can…

2011

A Semantic Scraping Model for Web Resources - Applying Linked Data to Web Page Screen Scraping

José Ignacio Fernández-VillamorJacobo Blasco-GarcíaC. Á. IglesiasM. Garijo
International Conference on Agents and Artificial…
2011
Corpus ID: 264210877

In spite of the increasing presence of Semantic Web Facilities, only a limited amount of the available resources in the Internet…

2007

Detecting Informative Web Page Blocks for Efficient Information Extraction Using Visual Block Segmentation

Jinbeom KangJoongmin Choi
International Symposium on Information Technology…
2007
Corpus ID: 17398663

As the structure of a Web page is getting more complicated, the construction of wrapper induction rules becomes more difficult…

2005

Parameterless Information Extraction Using (k,l)-Contextual Tree Languages

S. RaeymaekersM. Bruynooghe
2005
Corpus ID: 14324054

Recently, several wrapper induction algorithms for structured documents have been introduced. They are based on contextual tree…

2005

Automatically maintaining wrappers for Web sources

A substantial subset of the Web data follows some kind of underlying structure. Nevertheless, HTML does not contain any schema or…

2004

Schema-based Web wrapping

Bettina FazzingaSergio FlescaAndrea Tagarelli
Knowledge and Information Systems
2004
Corpus ID: 22451363

An effective solution to automate information extraction from Web pages is represented by wrappers. A wrapper associates a Web…

Highly Cited

2003

Highly Cited

2003

Schema-guided wrapper maintenance for web-data extraction

Xiaofeng MengDongdong HuChen Li
ACM International Workshop on Web Information and…
2003
Corpus ID: 8850461

Extracting data from Web pages using wrappers is a fundamental problem arising in a large variety of applications of vast…

2003

Mining Web Sites Using Wrapper Induction, Named Entities, and Post-processing

Georgios SigletosG. PaliourasC. SpyropoulosM. Hatzopoulos
European Web Mining Forum
2003
Corpus ID: 2656420

This paper presents a new framework for extracting information from collections of Web pages across different sites. In the…

2003

Automatic wrapper generation for semi-structures biological data based on table structure identification

Liangyou ChenH. JamilNan Wang
14th International Workshop on Database and…
2003
Corpus ID: 1594023

Biological data analyses usually require complex manipulations involving tool applications, multiple Web site navigation, result…

Wrapper (data mining)

Related topics

Broader (1)

Papers overview