Evaluation and Classification of Syntax Usage in Determining Short - Text Semantic Similarity

  • Published 2014

Abstract

This paper outlines and categorizes ways of using syntactic information in a number of algorithms for determining the semantic similarity of short texts. We consider the use of word order information, part-of-speech tagging, parsing and semantic role labeling. We analyze and evaluate the effects of syntax usage on algorithm performance by utilizing the results of a paraphrase detection test on the Microsoft Research Paraphrase Corpus. We also propose a new classification of algorithms based on their applicability to languages with scarce natural language processing tools.

1 Figure or Table

Cite this paper

@inproceedings{2014EvaluationAC, title={Evaluation and Classification of Syntax Usage in Determining Short - Text Semantic Similarity}, author={}, year={2014} }