Corpus ID: 102351638

Hybrid Approaches to Detect Comments Violating Macro Norms on Reddit

@article{Chandrasekharan2019HybridAT,
  title={Hybrid Approaches to Detect Comments Violating Macro Norms on Reddit},
  author={Eshwar Chandrasekharan and Eric Gilbert},
  journal={ArXiv},
  year={2019},
  volume={abs/1904.03596}
}
In this dataset paper, we present a three-stage process to collect Reddit comments that are removed comments by moderators of several subreddits, for violating subreddit rules and guidelines. Other than the fact that these comments were flagged by moderators for violating community norms, we do not have any other information regarding the nature of the violations. Through this procedure, we collect over 2M comments removed by moderators of 100 different Reddit communities, and publicly release… Expand
Crossmod: A Cross-Community Learning-based System to Assist Reddit Moderators
TLDR
Crossmod is the first open source, AI-backed sociotechnical moderation system to be designed using participatory methods, and an approach that leverages a large corpus of previous moderator decisions via an ensemble of classifiers. Expand
Norm violation in online communities - A study of Stack Overflow comments
TLDR
This work quantifies and analyzes the patterns of violations of normative behaviour among the users of Stack Overflow, and proposes a recommendation system that can be used to warn users that they are about to violate a norm, to help achieve norm adherence in online communities. Expand
A Just and Comprehensive Strategy for Using NLP to Address Online Abuse
TLDR
This position paper argues that the NLP community needs to make three substantive changes: expanding the scope of problems to tackle both more subtle and more serious forms of abuse, developing proactive technologies that counter or inhibit abuse before it harms, and reframing the effort within a framework of justice to promote healthy communities. Expand
Quarantined! Examining the Effects of a Community-Wide Moderation Intervention on Reddit
TLDR
It is found that the quarantine made it more difficult to recruit new members: new user influx to TRP and TD decreased by 79.5% and 58%, respectively, despite quarantining, existing users' misogyny and racism levels remained unaffected. Expand

References

SHOWING 1-10 OF 13 REFERENCES
You Can't Stay Here
In 2015, Reddit closed several subreddits-foremost among them r/fatpeoplehate and r/CoonTown-due to violations of Reddit's anti-harassment policy. However, the effectiveness of banning as aExpand
Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for Identifying and Classifying Hate in Online News Media
TLDR
A granular taxonomy of different types and targets of online hate and experiments with machine learning to generate a multiclass, multilabel classification model that automatically detects and categorizes hateful comments in the context of online news media are presented. Expand
The Internet's Hidden Rules
TLDR
Given the size of Reddit's user base, it is argued this represents the first large-scale census of the norms in broader internet culture, and shed light on what Reddit values, and how widely-held those values are. Expand
The Bag of Communities: Identifying Abusive Behavior Online with Preexisting Internet Data
TLDR
It is argued that the BoC approach may allow communities to deal with a range of common problems, like abusive behavior, faster and with fewer engineering resources. Expand
Antisocial Behavior in Online Discussion Communities
TLDR
This paper characterize antisocial behavior in three large online discussion communities by analyzing users who were banned from these communities, finding that such users tend to concentrate their efforts in a small number of threads, are more likely to post irrelevantly, and are more successful at garnering responses from other users. Expand
Automated Hate Speech Detection and the Problem of Offensive Language
TLDR
This work used a crowd-sourced hate speech lexicon to collect tweets containing hate speech keywords and labels a sample of these tweets into three categories: those containinghate speech, only offensive language, and those with neither. Expand
Prevalence and Psychological Effects of Hateful Speech in Online College Communities
TLDR
This work lays the foundation for studying the psychological impacts of hateful speech in online communities in general, and situated communities in particular (the ones that have both an offline and an online analog). Expand
Reddit Rules! Characterizing an Ecosystem of Governance
TLDR
It is found that rules appear to be context-dependent for individual subreddits but also share common characteristics across the site, motivating further inquiry into underlying mechanisms for rule formation and enforcement in online communities. Expand
"This Post Will Just Get Taken Down": Characterizing Removed Pro-Eating Disorder Social Media Content
TLDR
Characteristics of over 30,000 pro-eating disorder (pro-ED) posts that were at one point public on Instagram but have since been removed are examined to show that straightforward signals can be found in deleted content that distinguish them from other posts. Expand
Latent Dirichlet Allocation
We propose a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams [6], andExpand
...
1
2
...