Beyond topical similarity: a structural similarity measure for retrieving highly similar documents

@article{Wan2006BeyondTS,
  title={Beyond topical similarity: a structural similarity measure for retrieving highly similar documents},
  author={Xiaojun Wan},
  journal={Knowledge and Information Systems},
  year={2006},
  volume={15},
  pages={55-73}
}
Accurately measuring document similarity is important for many text applications, e.g. document similarity search, document recommendation, etc. Most traditional similarity measures are based only on “bag of words” of documents and can well evaluate document topical similarity. In this paper, we propose the notion of document structural similarity, which is expected to further evaluate document similarity by comparing document subtopic structures. Three related factors (i.e. the optimal… CONTINUE READING
Highly Cited
This paper has 32 citations. REVIEW CITATIONS
23 Citations
35 References
Similar Papers

Citations

Publications citing this paper.
Showing 1-10 of 23 extracted citations

References

Publications referenced by this paper.

Similar Papers

Loading similar papers…