Corpus ID: 54215055

CodEX: Source Code Plagiarism Detection Based on Abstract Syntax Tree

  title={CodEX: Source Code Plagiarism Detection Based on Abstract Syntax Tree},
  author={M. Zheng and X. Pan and D. Lillis},
  • M. Zheng, X. Pan, D. Lillis
  • Published in AICS 2018
  • Computer Science
  • CodEX is a source code search engine that allows users to search a repository of source code snippets using source code snippets as the query also. A potential use for such a search engine is to help educators identify cases of plagiarism in students’ programming assignments. This paper evaluates CodEX in this context. Abstract Syntax Trees (ASTs) are used to represent source code files on an abstract level. This, combined with node hashing and similarity calculations, allows users to search… CONTINUE READING

    Figures and Topics from this paper.

    Explore Further: Topics Discussed in This Paper


    Abstract Syntax Tree Analysis for Plagiarism Detection
    • 6
    Understanding source code evolution using abstract syntax tree matching
    • 191
    • Highly Influential
    • PDF
    Viewing functions as token sequences to highlight similarities in source code
    • 6
    • Highly Influential
    Software for detecting suspected plagiarism: comparing structure and attribute-counting systems
    • 99
    Sourcerer: a search engine for open source code supporting structure-based search
    • 204
    • PDF
    Detecting source code changes to maintain the consistence of behavioral model
    • 2
    • PDF
    Winnowing: local algorithms for document fingerprinting
    • 969
    • PDF