• Corpus ID: 231648407

VoterFraud2020: a Multi-modal Dataset of Election Fraud Claims on Twitter

  title={VoterFraud2020: a Multi-modal Dataset of Election Fraud Claims on Twitter},
  author={Anton Abilov and Yiqing Hua and Hana Matatov and Ofra Amir and Mor Naaman},
The wide spread of unfounded election fraud claims surrounding the U.S. 2020 election had resulted in undermining of trust in the election, culminating in violence inside the U.S. capitol. Under these circumstances, it is critical to understand discussions surrounding these claims on Twitter, a major platform where the claims disseminate. To this end, we collected and release the VoterFraud2020 dataset, a multi-modal dataset with 7.6M tweets and 25.6M retweets from 2.6M users related to voter… 

Figures and Tables from this paper

Misinfo Belief Frames: A Case Study on Covid & Climate News
The results using large-scale language modeling to predict misinformation frames show that machine-generated inferences can influence readers' trust in news headlines, demonstrating the potential effectiveness of using generated frames to counter misinformation.
Needle In A Haystack, Fast: Benchmarking Image Perceptual Similarity Metrics At Scale
It is shown that classical computer vision methods are not necessarily the best approach, whereas a pair of relatively little used methods Dhash perceptual hash and SimCLR v2 ResNets achieve excellent performance, scale well and are computationally efficient.
Characterizing YouTube and BitChute Content and Mobilizers During U.S. Election Fraud Discussions on Twitter
While BitChute videos promoting election fraud claims were linked to and engaged with in the Twitter discussion, they played a relatively small role compared to YouTube videos promoting fraud claims, this core finding points to the continued need for proactive, consistent, and collaborative content moderation solutions rather than the reactive and inconsistent solutions currently being used.
Election Fraud and Misinformation on Twitter: Author, Cluster, and Message Antecedents
The results highlight the earlier instances offake news and the greater diffusion speed of fake news in smaller user clusters and by users with fewer followers, so they suggest that monitors can detect fake news earlier by focusing on earlier tweets, smalleruser clusters, and users with less followers.
(Re)shaping online narratives: when bots promote the message of President Trump during his first impeachment
Influencing and framing debates on Twitter provides power to shape public opinion. Bots have become essential tools of ‘computational propaganda’ on social media such as Twitter, often contributing
Characterizing Reddit Participation of Users Who Engage in the QAnon Conspiracy Theories
The results point to the likelihood that the activities of early QAnon users on Reddit were dedicated and committed to the conspiracy, providing implications on both platform moderation design and future research.
“This is Fake! Shared it by Mistake”:Assessing the Intent of Fake News Spreaders
This work proposes an influence graph, using which to model individuals’ intent in fake news spreading, and shows that the assessed intent can help significantly differentiate between intentional and unintentional fake news spreaders.
A large-scale sentiment analysis of tweets pertaining to the 2020 US presidential election
The aim of this study is to highlight the importance of conducting sentiment analysis on all posts captured in real time, including those that are now inaccessible, in determining the true sentiments of the opinions around the time of an event.
Misinfo Reaction Frames: Reasoning about Readers’ Reactions to News Headlines
This work demonstrates the feasibility and importance of pragmatic inferences on news headlines to help enhance AI-guided misinformation detection and mitigation and introduces a Misinfo Reaction Frames corpus, a crowdsourced dataset of reactions to over 25k news headlines focusing on global crises.
The Polarized Web of the Voter Fraud Claims in the 2020 US Presidential Election
It is shown that the polarization of the Twitter discussions around the fraud claims in the US presidential election is evident and YouTube is connecting the two communities identified solely by structural community detection, thus being in the rare position of bringing together both right and left-leaning users.


#Election2020: the first public Twitter dataset on the 2020 US Presidential election
A massive-scale, longitudinal dataset of U.S. politics- and election-related tweets that predates and spans the entire period of the Republican and Democratic primaries, with real-time tracking of all presidential contenders on both sides of the aisle.
Analyzing the Digital Traces of Political Manipulation: The 2016 Russian Interference Twitter Campaign
Although an ideologically broad swath of Twitter users were exposed to Russian trolls in the period leading up to the 2016 U.S. Presidential election, it was mainly conservatives who helped amplify their message, revealing that they had a mostly conservative, pro-Trump agenda.
Characterizing the Use of Images in State-Sponsored Information Warfare Operations by Russian Trolls on Twitter
The first study of images shared by state-sponsored accounts by analyzing a ground truth dataset of 1.8M images posted to Twitter by accounts controlled by the Russian Internet Research Agency shows that the trolls were more effective in disseminating politics-related imagery than other images.
Towards Measuring Adversarial Twitter Interactions against Candidates in the US Midterm Elections
This study measures the adversarial interactions against candidates for the US House of Representatives during the run-up to the 2018 US general election, and develops a new technique for detecting tweets with toxic content that are directed at any specific candidate.
Social Media and Fake News in the 2016 Election
Following the 2016 U.S. presidential election, many have expressed concern about the effects of false stories (“fake news”), circulated largely through social media. We discuss the economics of fake
Retweet networks of the European Parliament: evaluation of the community structure
Empirical evidence is provided that the formation of retweet networks and community detection are appropriate tools to reveal real-world relationships, and can be used to uncover hidden properties when the ‘ground truth’ is not known.
#DebateNight: The Role and Influence of Socialbots on Twitter During the 1st U.S. Presidential Debate
It is found that many highly influential Twitter users are in fact pro-Democrat and that most pro-Republican users are mid-influential and likely to be human (low botness); a two-dimensional "polarization map" allows for a nuanced analysis of the interplay between botness, partisanship and influence.
Is the Sample Good Enough? Comparing Data from Twitter's Streaming API with Twitter's Firehose
Data collected using Twitter's sampled API service is compared with data collected using the full, albeit costly, Firehose stream that includes every single published tweet to help researchers and practitioners understand the implications of using the Streaming API.
Fake news on Twitter during the 2016 U.S. presidential election
Exposure to and sharing of fake news by registered voters on Twitter was examined and it was found that engagement with fake news sources was extremely concentrated and individuals most likely to engage withfake news sources were conservative leaning, older, and highly engaged with political news.
News Sharing User Behaviour on Twitter: A Comprehensive Data Collection of News Articles and Social Interactions
A robust pipeline for collecting datasets describing news sharing is built and a large-scale dataset is provided that can be used to study the social behavior of Twitter users and their involvement in the dissemination of news items.