A novel term weighting scheme with distributional coefficient for text categorization with support vector machine

Abstract

In text categorization, vectorizing a document by probability distribution is an effective dimension reduction way to save training time. However, the data sets that share many common keywords between categories affect the classification performance seriously. To address that problem, firstly, we conduct an effective term weighting scheme consisting of… (More)

Topics

2 Figures and Tables

Slides referencing similar topics