Authorship verification using deep belief network systems

  title={Authorship verification using deep belief network systems},
  author={Marcelo Luiz Brocardo and Issa Traor{\'e} and Isaac Woungang and Mohammad S. Obaidat},
  journal={International Journal of Communication Systems},
This paper explores the use of deep belief networks for authorship verification model applicable for continuous authentication (CA). The proposed approach uses Gaussian units in the visible layer to model real‐valued data on the basis of a Gaussian‐Bernoulli deep belief network. The lexical, syntactic, and application‐specific features are explored, leading to the proposal of a method to merge a pair of features into a single one. The CA is simulated by decomposing an online document into a… 

Deep Dive into Authorship Verification of Email Messages with Convolutional Neural Network

The proposed method implements the binary classification with a sequence-to-sequence (seq2seq) model and trains a convolutional neural network (CNN) on positive and negative examples, and verifies message authorship very accurately.

Design and Analysis of a Novel Authorship Verification Framework for Hijacked Social Media Accounts Compromised by a Human

A novel authorship verification framework for hijacked social media accounts, compromised by a human, is proposed and the ELECTRE approach is utilized for feature selection, and the rank exponent weight method is applied for feature weighting.

An intrinsic authorship verification technique for compromised account detection in social networks

An intrinsic profiling-based technique is presented for the assessment of authorship verification and its application toward detection of compromised accounts and performance of various one-class classifiers is analyzed on the basis of different evaluation metrics.

Improving author verification based on topic modeling

The comparison to state‐of‐the‐art methods demonstrates the great potential of the approaches presented in this study and demonstrates that even when genre‐agnostic external documents are used, the proposed extrinsic models are very competitive.

Authorship Verification of Yorùbá Blog Posts using Character N-grams

N-grams features were extracted from the corpus and inductive learning techniques was applied to build feature-based models in order to perform the automatic author identification and the result obtained signifies that the posts were from the same author.

Authentication of Short Messages from Social Networks Recurrent Artificial Neural Networks

This work aims the study of developing a system which is able to operate for finding the author of anonymous messages by providing to the system po of suspected users on social media and choosing matched author by combining stylometry with current computational capacity for short text messages.

Multi-Platform Authorship Verification

An analysis of authorship verification across four common messaging systems enables a direct comparison of recognition performance and provides a basis for analyzing the feature vectors across platforms to better understand what aspects each capitalize upon in order to achieve good classification.

Region Based Instance Document (RID) Approach Using Compression Features for Authorship Attribution

A new region based document model for authorship identification is proposed, to address the dimensionality problem of instance based approaches and scalability problem of profile based approaches.



Continuous authentication using micro-messages

This paper investigates two different classification schemes: on one hand Logistic Regression (LR) and on the other hand an hybrid classifier combining Support Vector Machine (SVM) and LR, and explored lexical, syntactic, and application specific features.

Short Text Authorship Attribution via Sequence Kernels, Markov Chains and Author Unmasking: An Investigation

An investigation of recently proposed character and word sequence kernels for the task of authorship attribution based on relatively short texts suggests that when using a realistic setup that takes into account the case of texts which are not written by any hypothesised authors, the amount of training material has more influence on discrimination performance than the amounts of test material.

Authorship verification for short messages using stylometry

A supervised learning technique combined with n-gram analysis for authorship verification in short texts with very promising results based on the Enron email dataset involving 87 authors.

Toward a Framework for Continuous Authentication Using Stylometry

This work adapts existing stylometric features and develops a new authorship verification model applicable for continuous authentication, which uses existing lexical, syntactic, and application specific features, and proposes new features based on n-gram analysis.

Applying authorship analysis to extremist-group Web forum messages

A special multilingual model is developed - the set of algorithms and related features - to identify Arabic messages, gearing this model toward the language's unique characteristics and incorporated a complex message extraction component to allow the use of a more comprehensive set of features tailored specifically toward online messages.

Authorship verification as a one-class classification problem

A new learning-based method for adducing the "depth of difference" between two example sets is presented and evidence that this method solves the authorship verification problem with very high accuracy is offered.

Plagiarism and authorship analysis: introduction to the special issue

The Internet has facilitated both the dissemination of anonymous texts as well aasy ‘‘borrowing’’ of ideas and words of others, which has raised a number of important questions regarding authorship.

Mining writeprints from anonymous e-mails for forensic investigation

Authorship Similarity Detection from Email Messages

This paper investigates techniques for authorship similarity detection from the text content of a short length, topic-free email and proposes a frequent pattern and machine learning based method.