Corpus ID: 14964721

Locate the Hate: Detecting Tweets against Blacks

  title={Locate the Hate: Detecting Tweets against Blacks},
  author={Irene Kwok and Yuzhou Wang},
Although the social medium Twitter grants users freedom of speech, its instantaneous nature and retweeting features also amplify hate speech. [...] Key Result We apply a supervised machine learning approach, employing inexpensively acquired labeled data from diverse Twitter accounts to learn a binary classifier for the labels "racist" and "nonracist" The classifier has a 76% average accuracy on individual tweets, suggesting that with further improvements, our work can contribute data on the sources of anti…Expand
Demographic Word Embeddings for Racism Detection on Twitter
This study presents a supervised learning strategy to detect racist language on Twitter based on word embedding that incorporate demographic information that incorporate age, gender, and location information. Expand
Detecting hate speech on Twitter A comparative study on the naive Bayes classifier
Hate speech and cyberbullying on social media platform Twitter is a growing issue, and to combat this they turn to machine learning and computer science. This study will investigate and compareExpand
Requiem for online harassers: Identifying racism from political tweets
During the last five years, the amount of users of online social networks has increased exponentially. With the growing of users, social problems also arise. Due to the nature of these platforms,Expand
Monitoring Users' Behavior: Anti-Immigration Speech Detection on Twitter
It is demonstrated that using word n-grams results in higher precision, recall, and f-score as compared to character n- grams, and the implications for future work on hate-speech detection and social media data analysis in general are discussed. Expand
Learning to Classify Hate and Extremism Promoting Tweets
The problem of hate and extremism promoting tweet identification as a one-class classification problem is formulated and several linguistic features are proposed that are effective and demonstrate that the proposed approach is effective. Expand
Supervised Classifiers to Identify Hate Speech on English and Spanish Tweets
An empirical study that analyzes the influence of two types of linguistic features when they are used to feed different supervised machine learning classifiers in the process of identifying hate speech on Twitter shows that CNB, SVM, and RF are better than the rest classifier in English and Spanish languages by taking into account all features. Expand
Modeling Hate Speech Detection in Social Media Interactions Using Bert
Hate speech propagation in social media sites has been happening over time and there is need to accurately identify and counter it so that those offended can seek redress and offenders can beExpand
Racial Bias in Hate Speech and Abusive Language Detection Datasets
Evidence of systematic racial bias in five different sets of Twitter data annotated for hate speech and abusive language is examined, as classifiers trained on them tend to predict that tweets written in African-American English are abusive at substantially higher rates. Expand
Abusive language Detection using Machine Learning
Social network sites involve billions of users around the world wide. User interactions with these social sites, like twitter have a tremendous and occasionally undesirable impact implications forExpand
"Like Sheep Among Wolves": Characterizing Hateful Users on Twitter
The results show that hateful users have more recent account creation dates, and more statuses, and followees per day, and are more central in the retweet network, contradicting the "lone wolf" stereotype often associated with such behavior. Expand


Detecting Hate Speech on the World Wide Web
We present an approach to detecting hate speech in online text, where hate speech is defined as abusive speech targeting specific group characteristics, such as ethnic origin, religion, gender, orExpand
Offensive Language Detection Using Multi-level Classification
An automatic flame detection method is described which extracts features at different conceptual levels and applies multi-level classification for flame detection and there is an auxiliary weighted pattern repository which improves accuracy by matching the text to its graded entries. Expand
Thumbs up? Sentiment Classification using Machine Learning Techniques
This work considers the problem of classifying documents not by topic, but by overall sentiment, e.g., determining whether a review is positive or negative, and concludes by examining factors that make the sentiment classification problem more challenging. Expand
Comparing naive Bayes, decision trees, and SVM with AUC and accuracy
It is proved that AUC is, in general, a better measure (defined precisely) than accuracy for evaluating performance of learning algorithms. Expand
In Proc
  • of the 2012 Workshop on LSM, 19-26.
  • 2012