A segment-based approach to clustering multi-topic documents

@article{Tagarelli2012ASA,
  title={A segment-based approach to clustering multi-topic documents},
  author={Andrea Tagarelli and George Karypis},
  journal={Knowledge and Information Systems},
  year={2012},
  volume={34},
  pages={563-595}
}
Document clustering has been recognized as a central problem in text data management. Such a problem becomes particularly challenging when document contents are characterized by subtopical discussions that are not necessarily relevant to each other. Existing methods for document clustering have traditionally assumed that a document is an indivisible unit for text representation and similarity computation, which may not be appropriate to handle documents with multiple topics. In this paper, we… CONTINUE READING
BETA

Citations

Publications citing this paper.
SHOWING 1-10 OF 29 CITATIONS

Topic Modeling in Financial Documents

VIEW 3 EXCERPTS
CITES METHODS
HIGHLY INFLUENCED

A genetic algorithm approach for topic clustering: A centroid-based encoding scheme

  • 2016 7th International Conference on Information, Intelligence, Systems & Applications (IISA)
  • 2016
VIEW 2 EXCERPTS
CITES METHODS

Similar Papers

Loading similar papers…