Incremental learning with partial-supervision based on hierarchical Dirichlet process and the application for document classification

Abstract

Hierarchical Dirichlet process (HDP) is an unsupervised method which has been widely used for topic extraction and document clustering problems. One advantage of HDP is that it has an inherent mechanism to determine the total number of clusters/topics. However, HDP has three weaknesses: (1) there is no mechanism to use known labels or incorporate expert… (More)
DOI: 10.1016/j.asoc.2015.04.044

13 Figures and Tables

Topics

  • Presentations referencing similar topics