Fake News Detection on Social Media: A Data Mining Perspective

@article{Shu2017FakeND,
  title={Fake News Detection on Social Media: A Data Mining Perspective},
  author={Kai Shu and Amy Lynn Sliva and Suhang Wang and Jiliang Tang and Huan Liu},
  journal={ArXiv},
  year={2017},
  volume={abs/1708.01967}
}
Social media for news consumption is a double-edged sword. On the one hand, its low cost, easy access, and rapid dissemination of information lead people to seek out and consume news from social media. On the other hand, it enables the wide spread of \fake news", i.e., low quality news with intentionally false information. The extensive spread of fake news has the potential for extremely negative impacts on individuals and society. Therefore, fake news detection on social media has recently… 

Figures and Tables from this paper

Understanding User Profiles on Social Media for Fake News Detection

A comparative analysis over explicit and implicit profile features between “experienced” users who are able to recognize fake news items as false and “naïve” Users who are more likely to believe fake news reveals their potential to differentiate fake news.

Beyond News Contents: The Role of Social Context for Fake News Detection

A tri-relationship embedding framework TriFN is proposed, which models publisher-news relations and user-news interactions simultaneously forfake news classification and significantly outperforms other baseline methods for fake news detection.

Beyond News Contents : The Role of Social Context for Fake News Detection

A tri-relationship embedding framework TriFN is proposed, which models publisher-news relations and user-news interactions simultaneously forfake news classification and significantly outperforms other baseline methods for fake news detection.

Beyond News Contents : The Role of Social Context for Fake News Detection

A tri-relationship embedding framework TriFN is proposed, which models publisher-news relations and user-news interactions simultaneously forfake news classification and significantly outperforms other baseline methods for fake news detection.

An Empirical Comparison of Fake News Detection using different Machine Learning Algorithms

This research aims at examining four well-known machine learning algorithms, namely the random forest, the Naive Bayes, the neural network, and the decision trees, distinctively to validate the efficiency of the classification performance on detecting fake news.

Exploiting Tri-Relationship for Fake News Detection

This paper explores the correlations of publisher bias, news stance, and relevant user engagements simultaneously, and proposes a Tri-Relationship Fake News detection framework (TriFN), and provides two comprehensive real-world fake news datasets to facilitate fake news research.

FakeNewsNet: A Data Repository with News Content, Social Context and Dynamic Information for Studying Fake News on Social Media

To facilitate fake news related researches, a fake news data repository FakeNewsNet is proposed, which contains two comprehensive datasets that includes news content, social context, and spatiotemporal information.

Automatic Fake News Detection by Exploiting User's Assessments on Social Networks: A Case Study of Twitter

The news content, the wisdom of crowds in the social interaction and the user’s credibility characteristics are exploited to automatically detect fake news on Twitter and a Support Vector Machine model with the Radial Basis Function kernel is applied to determine the authenticity of the news.

The Role of User Profiles for Fake News Detection

This work measures users' sharing behaviors and group representative users who are more likely to share fake and real news; then, a comparative analysis of explicit and implicit profile features between these user groups reveals their potential to help differentiate fake news from real news.

Unsupervised Fake News Detection on Social Media: A Generative Approach

This paper proposes an efficient collapsed Gibbs sampling approach to infer the truths of news and the users’ credibility without any labelled data, and shows that the proposed method significantly outperforms the compared unsupervised methods.
...

News Verification by Exploiting Conflicting Social Viewpoints in Microblogs

This paper discovers conflicting viewpoints in news tweets with a topic model method, and builds a credibility propagation network of tweets linked with supporting or opposing relations that generates the final evaluation result for news.

Novel Visual and Statistical Image Features for Microblogs News Verification

This paper explores the key role of image content in the task of automatic news verification on microblogs and proposes several visual and statistical features to characterize these patterns visually and statistically for detecting fake news.

CSI: A Hybrid Deep Model for Fake News Detection

This work proposes a model called CSI which is composed of three modules: Capture, Score, and Integrate, and incorporates the behavior of both parties, users and articles, and the group behavior of users who propagate fake news.

News Credibility Evaluation on Microblog with a Hierarchical Propagation Model

This work proposes a hierarchical propagation model for evaluating news credibility on Micro blog by formulating this propagation process as a graph optimization problem, and provides a globally optimal solution with an iterative algorithm.

Some Like it Hoax: Automated Fake News Detection in Social Networks

It is shown that Facebook posts can be classified with high accuracy as hoaxes or non-hoaxes on the basis of the users who "liked" them, and it is suggested that mapping the diffusion pattern of information can be a useful component of automatic hoax detection systems.

Social Spammer Detection with Sentiment Information

Experimental results on real-world social media datasets show the superior performance of the proposed framework by harnessing sentiment analysis for social spammer detection.

Automatic detection of rumor on Sina Weibo

This is the first study on rumor analysis and detection on Sina Weibo, China's leading micro-blogging service provider, and examines an extensive set of features that can be extracted from the microblogs, and trains a classifier to automatically detect the rumors from a mixed set of true information and false information.

Gleaning Wisdom from the Past: Early Detection of Emerging Rumors in Social Media

This paper identifies and utilizes patterns from prior labeled data to help reveal emergent rumors, and shows how much earlier it can detect an emerging rumor than traditional approaches.

Online Social Spammer Detection

A general optimization framework to collectively use content and network information for social spammer detection, and provide the solution for efficient online processing is presented.

Faking Sandy: characterizing and identifying fake images on Twitter during Hurricane Sandy

The role of Twitter, during Hurricane Sandy (2012) to spread fake images about the disaster was highlighted, and automated techniques can be used in identifying real images from fake images posted on Twitter.
...