Mining Text Data Mining Text Data

  title={Mining Text Data Mining Text Data},
  author={Charu C. Aggarwal and ChengXiang Zhai},
Clustering is a widely studied data mining problem in the text domains. The problem finds numerous applications in customer segmentation, classification, collaborative filtering, visualization, document organization, and indexing. In this chapter, we will provide a detailed survey of the problem of text clustering. We will study the key challenges of the clustering problem, as it applies to the text domain. We will discuss the key methods used for text clustering, and their relative advantages… CONTINUE READING


Publications citing this paper.
Showing 1-10 of 10 extracted citations

A Named Entity Recognition approach for Albanian

2013 International Conference on Advances in Computing, Communications and Informatics (ICACCI) • 2013
View 4 Excerpts
Highly Influenced

Information extraction from unstructured data using RDF

2016 International Conference on ICT in Business Industry & Government (ICTBIG) • 2016
View 1 Excerpt


Publications referenced by this paper.
Showing 1-10 of 131 references

Document clustering with universum

View 7 Excerpts
Highly Influenced

Dynamicity vs

W. Ke, C. Sugimoto, J. Mostafa
effectiveness: studying online clustering for scatter/gather. ACM SIGIR Conference • 2009
View 6 Excerpts
Highly Influenced

Feature Selection for Clustering

Encyclopedia of Database Systems • 2009
View 10 Excerpts
Highly Influenced

Relational Topic Models for Document Networks

AISTATS • 2009
View 7 Excerpts
Highly Influenced

Clustering Text Data Streams

Journal of Computer Science and Technology • 2008
View 10 Excerpts
Highly Influenced

Similar Papers

Loading similar papers…