How Text Segmentation Algorithms Gain from Topic Models

@inproceedings{Riedl2012HowTS,
  title={How Text Segmentation Algorithms Gain from Topic Models},
  author={Martin Riedl and Christian Biemann},
  booktitle={HLT-NAACL},
  year={2012}
}
This paper introduces a general method to incorporate the LDA Topic Model into text segmentation algorithms. We show that semantic information added by Topic Models significantly improves the performance of two wordbased algorithms, namely TextTiling and C99. Additionally, we introduce the new TopicTiling algorithm that is designed to take better advantage of topic information. We show consistent improvements over word-based methods and achieve state-of-the art performance on a standard dataset… CONTINUE READING
Highly Cited
This paper has 51 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.
31 Citations
16 References
Similar Papers

Citations

Publications citing this paper.
Showing 1-10 of 31 extracted citations

52 Citations

051015'13'15'17
Citations per Year
Semantic Scholar estimates that this publication has 52 citations based on the available data.

See our FAQ for additional information.

References

Publications referenced by this paper.

Similar Papers

Loading similar papers…