Towards combining web classification and web information extraction: a case study

Abstract

Web content analysis often has two sequential and separate steps: Web Classification to identify the target Web pages, and Web Information Extraction to extract the metadata contained in the target Web pages. This decoupled strategy is highly ineffective since the errors in Web classification will be propagated to Web information extraction and eventually… (More)
DOI: 10.1145/1557019.1557152

Topics

6 Figures and Tables

Slides referencing similar topics