Plagiarism detection based on structural information

@inproceedings{Stamatatos2011PlagiarismDB,
  title={Plagiarism detection based on structural information},
  author={Efstathios Stamatatos},
  booktitle={CIKM},
  year={2011}
}
In this paper a novel method for detecting plagiarized passages in document collections is presented. In contrast to previous work in this field that uses mainly content terms to represent documents, the proposed method is based on structural information provided by occurrences of a small list of stopwords (i.e., very frequent words). We show that stopword n-grams are able to capture local syntactic similarities between suspicious and original documents. Moreover, an algorithm for detecting the… CONTINUE READING
Highly Cited
This paper has 18 citations. REVIEW CITATIONS