Towards Structure-sensitive Hypertext Categorization

Abstract

Hypertext categorization is the task of automatically assigning category labels to hypertext units. Comparable to text categorization it stays in the area of function learning based on the bag-of-features approach. This scenario faces the problem of a many-to-many relation between websites and their hidden logical document structure. The paper argues that… (More)
DOI: 10.1007/3-540-31314-1_49

Topics

4 Figures and Tables