Citation based plagiarism detection: a new approach to identify plagiarized work language independently

  title={Citation based plagiarism detection: a new approach to identify plagiarized work language independently},
  author={Bela Gipp and J. Beel},
  booktitle={HT '10},
This paper describes a new approach towards detecting plagiarism and scientific documents that have been read but not cited. In contrast to existing approaches, which analyze documents' words but ignore their citations, this approach is based on citation analysis and allows duplicate and plagiarism detection even if a document has been paraphrased or translated, since the relative position of citations remains similar. Although this approach allows in many cases the detection of plagiarized… Expand
Citation-based plagiarism detection - idea, implementation and evalutation
  • Bela Gipp
  • Computer Science
  • Bull. IEEE Tech. Comm. Digit. Libr.
  • 2012
Citation-based Plagiarism Detection is by no means a replacement for the currently used text-based approaches, but should be considered as a complement for identifying currently hard to find well-disguised plagiarisms. Expand
Comparative evaluation of text- and citation-based plagiarism detection approaches using guttenplag
It is shown that citation-based plagiarism detection performs significantly better than text-based procedures in identifying strong paraphrasing, translation and some idea plagiarism. Expand
Identifying Related Work and Plagiarism by Citation Analysis
  • Bela Gipp
  • Computer Science
  • Bull. IEEE Tech. Comm. Digit. Libr.
  • 2011
This updated and revised paper gives an overview of my PhD research, which focuses on two newly developed approaches to plagiarism detection, called Citation Proximity Analysis and Citation based Plagiarism Detection. Expand
Citation pattern matching algorithms for citation-based plagiarism detection: greedy citation tiling, citation chunking and longest common citation sequence
Three algorithms are introduced and it is shown that if these algorithms are combined, common forms of plagiarism can be detected reliably and Greedy Citation Tiling, Citation Chunking and Longest Common Citation Sequence are combined. Expand
CitePlag : A Citation-based Plagiarism Detection System Prototype
An open-source prototype of a citation-based plagiarism detection system called CitePlag, to evaluate the citations of academic documents as language independent markers to detect plagiarism, is presented. Expand
Hybrid technique for plagiarism detection based on text and citation comparison
Plagiarism is a “stealing of academic assets”. In earlier days, numerous text documents are accessible on the web and that are effortless to have an access of it. Appropriate to this largeExpand
Comparing and combining Content‐ and Citation‐based approaches for plagiarism detection
This work compares content and citation‐based approaches for plagiarism detection with the goal of evaluating whether they are complementary and if their combination can improve the quality of the detection and concluded that a combination of the methods can be beneficial. Expand
A Knowledge Based Approach to Detection of Idea Plagiarism in Online Research Publications
Plagiarism is on the rise because of the easy access to information through World Wide Web. Web pages are growing in the internet on daily basis. Researchers want to be well connected globally toExpand
Plagiarism Detection Methods and Tools : An Overview
Plagiarism Detection Systems play an important role in revealing instances of a plagiarism act, especially in the educational sector with scientific documents and papers. The idea of plagiarism isExpand
State-of-the-art in detecting academic plagiarism
In the future, plagiarism detection systems may benefit from combining traditional character-based detection methods with these emerging detection approaches, including intrinsic, cross-lingual and citation-based plagiarism Detection. Expand


Citation Proximity Analysis (CPA) : A New Approach for Identifying Related Work Based on Co-Citation Analysis
The approach called Citation Proximity Analysis (CPA) is a further development of co-citation analysis, but in addition, considers the proximity of citations to each other within an article’s full-text. Expand
Co-citation in the scientific literature: A new measure of the relationship between two documents
  • H. Small
  • Computer Science
  • J. Am. Soc. Inf. Sci.
  • 1973
A new form of document coupling called co-citation is defined as the frequency with which two documents are cited together, and clusters of co- cited papers provide a new way to study the specialty structure of science. Expand
Computer-based plagiarism detection methods and tools: an overview
The paper is dedicated to plagiarism problem. The ways how to reduce plagiarism: both: plagiarism prevention and plagiarism detection are discussed. Widely used plagiarism detection methods areExpand
System of Document Connections Based on References
Analysis of bibliographic references is becoming a tool of studying information processes in science, of classification of documents. In order to classifL documents in a field of knowledge it isExpand
Bibliographic coupling between scientific papers
The population of papers under study was ordered into groups that satisfy the stated criterion of interrelation and an examination of the papers that constitute the groups shows a high degree of logical correlation. Expand
This document presents an LDAP schema for the CIM version 2.5 Physical Information Model [1].
PAN-09 3rd Workshop on Uncovering Plagiarism, Authorship and Social Software Misuse and 1st International Competition on Plagiarism Detection
  • PAN-09 3rd Workshop on Uncovering Plagiarism, Authorship and Social Software Misuse and 1st International Competition on Plagiarism Detection
  • 2009