• Publications
  • Influence
Is the Sample Good Enough? Comparing Data from Twitter's Streaming API with Twitter's Firehose
TLDR
Twitter is a social media giant famous for the exchange of short, 140-character messages called "tweets". Expand
  • 776
  • 61
  • PDF
Feature Selection
TLDR
Feature selection, as a data preprocessing strategy, has been proven to be effective and efficient in preparing data (especially high-dimensional data) for various data-mining and machine-learning problems. Expand
  • 584
  • 37
  • PDF
Advancing Feature Selection Research − ASU Feature Selection Repository
TLDR
The rapid advance of computer based high-throughput technique have provided unparalleled opportunities for humans to expand capabilities in production, services, communications, and research. Expand
  • 115
  • 11
  • PDF
A new approach to bot detection: Striking the balance between precision and recall
TLDR
We propose a bot detection method that optimizes the F1 score of the model, which considers recall in addition to precision. Expand
  • 112
  • 10
  • PDF
Twitter Data Analytics
TLDR
This brief provides methods for harnessing Twitter data to discover solutions to complex inquiries. Expand
  • 214
  • 7
  • PDF
Advancing feature selection research
TLDR
Feature selection is an essential step in successful data mining applications, which can effectively reduce data dimensionality by removing the irrelevant (and the redundant) features. Expand
  • 140
  • 6
A Survey on Bias and Fairness in Machine Learning
TLDR
We review research investigating how biases in data skew what is learned by machine learning algorithms, and we listed different sources of biases that can affect AI applications. Expand
  • 152
  • 6
  • PDF
Finding Eyewitness Tweets During Crises
Disaster response agencies incorporate social media as a source of fast-breaking information to understand the needs of people affected by the many crises that occur around the world. These agenciesExpand
  • 45
  • 5
  • PDF
Whom should I follow?: identifying relevant users during crises
TLDR
Social media is gaining popularity as a medium of communication before, during, and after crises. Expand
  • 58
  • 4
  • PDF
Leveraging the Implicit Structure within Social Media for Emergent Rumor Detection
TLDR
We propose a method for classifying conversations within their formative stages as well as improving accuracy within mature conversations through the discovery of implicit linkages between conversation fragments. Expand
  • 52
  • 3
  • PDF