Not So Cute but Fuzzy: Estimating Risk of Sexual Predation in Online Conversations

  title={Not So Cute but Fuzzy: Estimating Risk of Sexual Predation in Online Conversations},
  author={Tatiana R. Ringenberg and Kanishka Misra and Julia Taylor Rayz},
  journal={2019 IEEE International Conference on Systems, Man and Cybernetics (SMC)},
The sexual exploitation of minors is a known and persistent problem for law enforcement. Assistance in prioritizing cases of sexual exploitation of potentially risky conversations is crucial. While attempts to automatically triage conversations for the risk of sexual exploitation of minors have been attempted in the past, most computational models use features which are not representative of the grooming process that is used by investigators. Accurately annotating an offender corpus for use… 

Figures and Tables from this paper

Ensemble Method for Sexual Predators Identification in Online Chats
Several machine learning classifiers including Naive Bayes, Support Vector Machine, Neural Network, Logistic Regression, Random Forest, K-Nearest Neighbors, and Decision Tree with Bag of Words features are investigated to improve the classification task.
A Human-Centered Systematic Literature Review of the Computational Approaches for Online Sexual Risk Detection
A comprehensive literature review to analyze 73 peer-reviewed articles on computational approaches utilizing text or meta-data/multimedia for online sexual risk detection found that the majority of work has focused on identifying sexual predators after-the-fact, rather than taking more nuanced approaches to identify potential victims and problematic patterns that could be used to prevent victimization before it occurs.
Implications of Using Internet Sting Corpora to Approximate Underage Victims
The corpus is annotated for stages and tactics of the victimization process described within psychology research and found significant differences in the three classes of chats that are usually not taken into account in chat classification.


A Learning-Based Approach for the Identification of Sexual Predators in Chat Logs
This team designed several innovative sets of features that guide the construction of classifiers for identifying sexual predation and their methods are driven by psycholinguistic, chat-based, and tf/idf features and yield to very effective classifiers.
Learning to Identify Internet Sexual Predation
This work integrates communication theories and computer science algorithms to create a program that can detect the occurrence of sexual predation in an online social setting and uses machine learning algorithms to classify posts.
Detecting Child Grooming Behaviour Patterns on Social Media
This paper proposes to characterise such stages of child grooming by a series of features covering sentiment polarity, content, and psycho-linguistic and discourse patterns, vital for building robust systems for detecting grooming conversations and potential predators on social media.
Detecting predatory conversations in social media by deep Convolutional Neural Networks
Characterizing Pedophile Conversations on the Internet using Online Grooming
This paper studies pedophile conversations from the perspective of online grooming theory and performs a series of linguistic-based empirical analysis on several pedophile chat conversations to gain useful insights and patterns, and presents empirical results that throw light on various aspects of a pedophile conversation.
Deconstructing the Online Grooming of Youth: Toward Improved Information Systems for Detection of Online Sexual Predators
Through a grounded theory analysis of ninety Perverted Justice (PVJ) transcripts, of conversations between convicted online sexual predators and PVJ volunteers who posed as youth, five categories of online predator behavior inside of text during victimization of children are identified.
Linguistic analysis of chat transcripts from child predator undercover sex stings
Abstract We analyzed chat transcripts from 590 undercover Internet sex stings across the US, using the Linguistic Inquiry and Word Count software program to examine trends in sexual word usage, total
Exploring high-level features for detecting cyberpedophilia
Sexual offenders contacting children online: an examination of transcripts of sexual grooming
ABSTRACT The present study investigated transcripts of adults sexually grooming decoy victims on the Internet. One hundred transcripts were coded for offender characteristics, victim characteristics,