Taking advantage of Turkish characteristic features to achieve authorship attribution problems for Turkish

Abstract

The rapid increase in the number of the electronic and online texts such as electronic mails, online newspapers and magazines, blog posts and online forum messages has also accelerated the studies carried out on authorship attribution. Although the studies are not as abundant as in English language, there have been considerable studies on author identification in Turkish in the last fifteen years. This study includes two parts; first part is a quick review of Turkish authorship attribution studies. The review is focused on the stylometric features that enable authors to be distinguished one from another. In the second part, we analyze the main characteristics of the Turkish Language and depict our first experiments on Turkish corpora. We experiment taking advantages of Turkish characteristic features by using frequencies of gerunds, and use Support Vector Machines as learning algorithm.

DOI: 10.1109/SIU.2017.7960438

Cite this paper

@article{Saygili2017TakingAO, title={Taking advantage of Turkish characteristic features to achieve authorship attribution problems for Turkish}, author={Neslihan Sirin Saygili and Tassadit Amghar and Bernard Levrat and Tankut Acarman}, journal={2017 25th Signal Processing and Communications Applications Conference (SIU)}, year={2017}, pages={1-4} }