Automatic Webpage Classification Enhanced by Unlabeled Data

Abstract

This paper describes a novel method for webpage classification that uses unlabeled data. The proposed method is based on a sequential learning of the classifiers which are trained on a small number of labeled data and then augmented by a large number of unlabeled data. By taking advantage of unlabeled data, the effective number of labeled data needed is… (More)
DOI: 10.1007/978-3-540-45080-1_113

4 Figures and Tables

Topics

  • Presentations referencing similar topics