A Comparative Study of Language Models for Book and Author Recognition

@inproceedings{Uzuner2005ACS,
  title={A Comparative Study of Language Models for Book and Author Recognition},
  author={{\"O}zlem Uzuner and Boris Katz},
  booktitle={IJCNLP},
  year={2005}
}
Linguistic information can help improve evaluation of similarity between documents; however, the kind of linguistic information to be used depends on the task. In this paper, we show that distributions of syntactic structures capture the way works are written and accurately identify individual books more than 76% of the time. In comparison, baseline features, e.g., tfidf-weighted keywords, function words, etc., give an accuracy of at most 66%. However, testing the same features on authorship… CONTINUE READING

8 Figures & Tables

Topics

Statistics

051015'05'06'07'08'09'10'11'12'13'14'15'16'17'18
Citations per Year

67 Citations

Semantic Scholar estimates that this publication has 67 citations based on the available data.

See our FAQ for additional information.