• Publications
  • Influence
A survey of modern authorship attribution methods
TLDR
A survey of recent advances of the automated approaches to attributing authorship is presented, examining their characteristics and settings. Expand
  • 978
  • 134
  • PDF
Overview of the 6th International Competition on Plagiarism Detection
TLDR
Thispaper overviews 18 plagiarism detectors that have been developed and evaluated within PAN'10. Expand
  • 370
  • 37
  • PDF
Automatic Text Categorization in Terms of Genre and Author
TLDR
We propose a set of style markers including analysis-level measures that represent the way in which the input text has been analyzed and capture useful stylistic information without additional cost. Expand
  • 438
  • 25
  • PDF
N-Gram Feature Selection for Authorship Identification
TLDR
We propose a variable-length n-gram approach inspired by previous work for selecting variable- length word sequences. Expand
  • 185
  • 19
  • PDF
Improving the Reproducibility of PAN's Shared Tasks: - Plagiarism Detection, Author Identification, and Author Profiling
TLDR
This paper reports on the PAN 2014 evaluation lab which hosts three shared tasks on plagiarism detection, author identification, and author profiling. Expand
  • 185
  • 18
  • PDF
Computer-Based Authorship Attribution Without Lexical Measures
TLDR
The most important approaches to computer-assistedauthorship attribution are exclusively based onlexical measures that either represent the vocabularyrichness of the author or simply comprise frequenciesof occurrence of common words. Expand
  • 266
  • 18
  • PDF
Intrinsic Plagiarism Detection Using Character n-gram Profiles
TLDR
A new method for intrinsic plagiarism detection has been presented that attempts to quantify the style variation within a document using character n-gram profiles and a style change function based on an appropriate dissimilarity measure. Expand
  • 166
  • 17
  • PDF
Overview of the Author Identification Task at PAN 2013
TLDR
The author identification task at PAN-2014 focuses on author verification. Expand
  • 168
  • 16
  • PDF
A Profile-Based Method for Authorship Verification
TLDR
We show that the profile-based paradigm (where all samples by one author are treated cumulatively) can be very effective surpassing the performance of PAN-2013 winners without using any information from external sources. Expand
  • 51
  • 15
  • PDF
A survey of modern authorship attribution methods
TLDR
A survey of recent advances of the automated approaches to attributing authorship is presented, examining their characteristics for both text representation and text classification. Expand
  • 324
  • 14