Nowcasting Events from the Social Web with Statistical Learning

  title={Nowcasting Events from the Social Web with Statistical Learning},
  author={Vasileios Lampos and Nello Cristianini},
  journal={ACM Trans. Intell. Syst. Technol.},
We present a general methodology for inferring the occurrence and magnitude of an event or phenomenon by exploring the rich amount of unstructured textual information on the social part of the Web. Having geo-tagged user posts on the microblogging service of Twitter as our input data, we investigate two case studies. The first consists of a benchmark problem, where actual levels of rainfall in a given location and time are inferred from the content of tweets. The second one is a real-life task… 

Detecting Events and Patterns in Large-Scale User Generated Textual Streams with Statistical Learning Methods

It is shown that it is possible to extract valuable information about social phenomena by automatic analysis of the content published in Social Media, and in particular Twitter, using Statistical Machine Learning methods.

Personalized emerging topic detection based on a term aging model

This article proposes a novel, user-aware topic detection technique that permits to retrieve, in real time, the most emerging topics of discussion expressed by the community within the interests of specific users.

Epidemic Intelligence for the Crowd, by the Crowd

This work reports how tracking Twitter helped to detect key user messages that triggered signal detection alarms before MedISys and other well established early warning systems and introduces a personalized learning to rank approach that exploits the relationships discovered by latent semantic topics computed using Latent Dirichlet Allocation.

Forecasting Stock Returns Based on Event Detection from Twitter

The results showed that the tweets collected by the financial community allow us to detect important events in the life of companies and show an above average value when compared to the application of the Buy&Hold strategy to DJIA.

Detecting and Tracking Significant Events for Individuals on Twitter by Monitoring the Evolution of Twitter Followership Networks

This article presents an event discovery method based on the change of the user’s followers, which can detect the occurrences of significant events relevant to the particular user and divides these events into categories according to the positive or negative effect on the specific user.

A Location Inferring Model Based on Tweets and Bilateral Follow Friends

This paper proposes a location inferring model for microblog users who have not geo-tagged based on their tweets content and bilateral follow friends, and can achieve better accuracy than the TEDAS method.

Twitter as a Personalizable Information Service

This chapter proposes an innovative term aging model, based on a biological metaphor, to retrieve the freshest arguments of discussion, represented through a minimal set of terms, expressed by the community within the foci of interest of a specific user.

Segmenting, Summarizing and Predicting Data Sequences

This dissertation shows how to carefully exploit within-sequence and across-sequence correlations to develop more effective and scalable algorithms to solve sequence mining problems in other domains as well.

Syndromic surveillance of Flu on Twitter using weakly supervised temporal topic models

Two temporal topic models are proposed to capture hidden states of a user from his tweets and aggregate states in a geographical region for better estimation of trends and it is shown that these approaches help fill the gap between phenomenological methods for disease surveillance and epidemiological models.

Event detection from Twitter - a survey

This study aims to present and review the varied methods carried out for event detection from microblogs, a popular Web service that enables users to post small pieces of digital content, such as text, picture, video and link to external resource.



Towards detecting influenza epidemics by analyzing Twitter messages

This paper analyzes messages posted on the micro-blogging site to propose several methods to identify influenza-related messages and compare a number of regression models to correlate these messages with CDC statistics.

Earthquake shakes Twitter users: real-time event detection by social sensors

This paper investigates the real-time interaction of events such as earthquakes in Twitter and proposes an algorithm to monitor tweets and to detect a target event and produces a probabilistic spatiotemporal model for the target event that can find the center and the trajectory of the event location.

Tracking the flu pandemic by monitoring the social web

A monitoring tool to measure the prevalence of disease in a population by analysing the contents of social networking tools, such as Twitter, and turning statistical information into a flu-score, which can be used at close time intervals to provide inexpensive and timely information about the state of an epidemic.

Flu Detector - Tracking Epidemics on Twitter

We present an automated tool with a web interface for tracking the prevalence of Influenza-like Illness (ILI) in several regions of the United Kingdom using the contents of Twitter's microblogging

Predicting the Future with Social Media

  • S. AsurB. Huberman
  • Computer Science
    2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology
  • 2010

Monitoring Influenza Trends through Mining Social Media

The results of the analysis show Influenza-related blogging trends have a significant correlation with the beginning of US Fall 2008 flu season, and identifies WSM Influenza -related communities that share flu-postings which could broker or disseminate information in the case of a severe outbreak or Influenza epidemic.

Detecting influenza epidemics using search engine query data

A method of analysing large numbers of Google search queries to track influenza-like illness in a population and accurately estimate the current level of weekly influenza activity in each region of the United States with a reporting lag of about one day is presented.

Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment

It is found that the mere number of messages mentioning a party reflects the election result, and joint mentions of two parties are in line with real world political ties and coalitions.

Opinion Mining and Sentiment Analysis

This survey covers techniques and approaches that promise to directly enable opinion-oriented information-seeking systems and focuses on methods that seek to address the new challenges raised by sentiment-aware applications, as compared to those that are already present in more traditional fact-based analysis.