Xiangyi Ye

We don’t have enough information about this author to calculate their statistics. If you think this is an error let us know.
Learn More
Properties of corpora, such as the diversity of vocabulary and how tightly related texts cluster together, impact the best way to cluster short texts. We examine several such properties in a variety of corpora and track their effects on various combinations of similarity metrics and clustering algorithms. We show that semantic similarity metrics outperform(More)
  • 1