Assessing the impact of a health intervention via user-generated Internet content

  title={Assessing the impact of a health intervention via user-generated Internet content},
  author={Ingemar Johansson and Vasileios Lampos and Elad Yom-Tov and Richard G. Pebody and Ingemar J. Cox and Jo{\~a}o Gama and Indrė Žliobaitė},
  journal={Data Mining and Knowledge Discovery},
Assessing the effect of a health-oriented intervention by traditional epidemiological methods is commonly based only on population segments that use healthcare services. Here we introduce a complementary framework for evaluating the impact of a targeted intervention, such as a vaccination campaign against an infectious disease, through a statistical analysis of user-generated content submitted on web platforms. Using supervised learning, we derive a nonlinear regression model for estimating the… 

Assessing public health interventions using Web content

Disease models from online data combined with historical patterns of disease prevalence across different areas allowed us to quantify the impact of the intervention and demonstrated that they were in line with independent assessments from public health authorities.

Assessing public health interventions using Web content Extended Abstract

Disease models from online data combined with historical patterns of disease prevalence across different areas allowed us to quantify the impact of the intervention and were in line with independent assessments from public health authorities.

The added value of online user-generated content in traditional methods for influenza surveillance

It is shown that the Twitter-based models perform poorly and hypothesise that this is mostly due to the sparsity of the data available and a limited training period, while the Google-based model provides accurate estimates with timeliness of approximately one week and has the potential to complement current surveillance systems.

Estimating the Population Impact of a New Pediatric Influenza Vaccination Program in England Using Social Media Content

These findings corroborate independent assessments from traditional surveillance data, thereby supporting the ongoing rollout of the program to primary school-age children and providing evidence of the value of social media content as an additional syndromic surveillance tool.

Systematic Review on Information Technology Approaches to Evaluate the Impact of Public Health Campaigns: Real Cases and Possible Directions

Evaluating the success of a public health campaign is critical. It helps policy makers to improve prevention strategies and close existing gaps. For instance, Brazil's “Syphilis No!” campaign reached

Tracking COVID-19 using online search

This work develops unsupervised modelling techniques based on associated symptom categories identified by the United Kingdom’s National Health Service and Public Health England, and shows that web searches improve the short-term forecasting accuracy of autoregressive models for COVID-19 deaths.

Flu Detector: Estimating influenza-like illness rates from online user-generated content

Notably, the models based on Google data achieve a high level of accuracy with respect to the most recent four flu seasons in England and were highlighted as having a great potential of becoming a complementary source to the domestic traditional flu surveillance schemes.

On Infectious Intestinal Disease Surveillance using Social Media Content

The experimental results indicate that Twitter data contain a signal that could be strong enough to complement conventional methods for IID surveillance, and like previous text regression tasks, the nonlinear approach performs better.

Screening for Cancer Using a Learning Internet Advertising System

The utility of using modern advertising systems to identify people who are likely suffering from serious medical conditions, as well as questionnaire responses provided to the conversion optimization mechanism of the Google advertisement system, are demonstrated.

Enhancing Feature Selection Using Word Embeddings: The Case of Flu Surveillance

This paper uses neural word embeddings, trained on social media content from Twitter, to determine, in an unsupervised manner, how strongly textual features are semantically linked to an underlying health concept, and proposes a hybrid feature selection method that creates a more reliable model.



Tracking the flu pandemic by monitoring the social web

A monitoring tool to measure the prevalence of disease in a population by analysing the contents of social networking tools, such as Twitter, and turning statistical information into a flu-score, which can be used at close time intervals to provide inexpensive and timely information about the state of an epidemic.

Towards detecting influenza epidemics by analyzing Twitter messages

This paper analyzes messages posted on the micro-blogging site to propose several methods to identify influenza-related messages and compare a number of regression models to correlate these messages with CDC statistics.

Discovering Health Topics in Social Media Using Topic Models

A statistical topic model created for this purpose, the Ailment Topic Aspect Model (ATAM), as well as a system for filtering general Twitter data based on health keywords and supervised classification are described, demonstrating that a single general-purpose model can identify many different health topics in social media.

Using internet searches for influenza surveillance.

This work counted daily unique queries originating in the United States that contained influenza-related search terms from the Yahoo! search engine from March 2004 through May 2008, and estimated linear models, using searches with 1-10-week lead times as explanatory variables to predict the percentage of cultures positive for influenza and deaths attributable to pneumonia and influenza in the US.

Infodemiology: Tracking Flu-Related Searches on the Web for Syndromic Surveillance

Systematically collecting and analyzing health information demand data from the Internet has considerable potential to be used for syndromic surveillance.

Flu Detector - Tracking Epidemics on Twitter

We present an automated tool with a web interface for tracking the prevalence of Influenza-like Illness (ILI) in several regions of the United Kingdom using the contents of Twitter's microblogging

Detecting influenza epidemics using search engine query data

A method of analysing large numbers of Google search queries to track influenza-like illness in a population and accurately estimate the current level of weekly influenza activity in each region of the United States with a reporting lag of about one day is presented.

Nowcasting Events from the Social Web with Statistical Learning

A general methodology for inferring the occurrence and magnitude of an event or phenomenon by exploring the rich amount of unstructured textual information on the social part of the Web by investigating two case studies of geo-tagged user posts on the microblogging service of Twitter.

Pandemics in the Age of Twitter: Content Analysis of Tweets during the 2009 H1N1 Outbreak

Twitter can be used for real-time content analysis and knowledge translation research, allowing health authorities to respond to public concerns, and illustrates the potential of using social media to conduct “infodemiology” studies for public health.

Reassessing Google Flu Trends Data for Detection of Seasonal and Pandemic Influenza: A Comparative Epidemiological Study at Three Geographic Scales

GFT data may not provide reliable surveillance for seasonal or pandemic influenza and should be interpreted with caution until the algorithm can be improved and evaluated, and current internet search query data are no substitute for timely local clinical and laboratory surveillance, or national surveillance based on local data collection.