Improving Text Classification Accuracy by Training Label Cleaning

  title={Improving Text Classification Accuracy by Training Label Cleaning},
  author={Andrea Esuli and Fabrizio Sebastiani},
  journal={ACM Trans. Inf. Syst.},
In text classification (TC) and other tasks involving supervised learning, labelled data may be scarce or expensive to obtain. Semisupervised learning and active learning are two strategies whose aim is maximizing the effectiveness of the resulting classifiers for a given amount of training effort. Both strategies have been actively investigated for TC in recent years. Much less research has been devoted to a third such strategy, training label cleaning (TLC), which consists in devising ranking… CONTINUE READING
Recent Discussions
This paper has been referenced on Twitter 1 time over the past 90 days. VIEW TWEETS
7 Citations
3 References
Similar Papers


Publications referenced by this paper.
Showing 1-3 of 3 references

Reuters-21578 text categorization test collection Distribution 1.0 README file (v 1.3)

  • D. D. Lewis
  • 2004
Highly Influential
2 Excerpts

Similar Papers

Loading similar papers…