Corpus ID: 236447922

A Biomedically oriented automatically annotated Twitter COVID-19 Dataset

  title={A Biomedically oriented automatically annotated Twitter COVID-19 Dataset},
  author={Luis Alberto Robles Hernandez and Tiffany J. Callahan and J. Banda},
The use of social media data, like Twitter, for biomedical research has been gradually increasing over the years. With the COVID-19 pandemic, researchers have turned to more non-traditional sources of clinical data to characterize the disease in near-real time, study the societal implications of interventions, as well as the sequelae that recovered COVID-19 cases present (Long-). However, manually curated social media datasets are difficult to come by due to the expensive costs of manual… Expand

Tables from this paper


Top Concerns of Tweeters During the COVID-19 Pandemic: Infoveillance Study
The main topics posted by Twitter users related to the COVID-19 pandemic were identified and grouped into four main themes: origin of the virus; its sources; its impact on people, countries, and the economy; and ways of mitigating the risk of infection. Expand
Long-term patient-reported symptoms of COVID-19: an analysis of social media data
This work uses a combination of natural language processing and clinician reviews to identify long term self-reported symptoms on a set of Twitter users, and identifies latent symptoms that might be underreported in other places. Expand
Mining twitter to explore the emergence of COVID-19 symptoms.
Findings revealed that many COVID-19-related symptoms mentioned in Twitter tweets earlier than the announcement by the CDC were mentioned in tweets posted during the early stages of the pandemic. Expand
KG-COVID-19: a framework to produce customized knowledge graphs for COVID-19 response
KG-COVID-19 is a flexible framework that ingests and integrates biomedical data to produce knowledge graphs (KGs) that can be customized for downstream applications including machine learning tasks, hypothesis-based querying, and browsable user interface to enable researchers to explore CO VID-19 data and discover relationships. Expand
A scoping review of the use of Twitter for public health research
A clear picture of the use of Twitter for public health is obtained and insights are gained into how the popularity of different domains changed with time, the diseases and conditions studied and the different approaches to understanding each disease, which algorithms and techniques were popular with each domain, and more. Expand
Machine Learning to Detect Self-Reporting of Symptoms, Testing Access, and Recovery Associated With COVID-19 on Twitter: Retrospective Big Data Infoveillance Study
This study used unsupervised machine learning for the purposes of characterizing self-reporting of symptoms, experiences with testing, and mentions of recovery related to COVID-19. Expand
Representing the Twittersphere: Archiving a representative sample of Twitter data under resource constraints
This work proposes and test a methodology for inexpensively creating an archive of Twitter data through population sampling, yielding a database that is highly representative of the targeted user population (in this test case, the entire population of Japanese-language Twitter users), and concludes that this approach yields a data set that is suitable for a wide range of post-hoc analyses. Expand
Pandemics in the Age of Twitter: Content Analysis of Tweets during the 2009 H1N1 Outbreak
Twitter can be used for real-time content analysis and knowledge translation research, allowing health authorities to respond to public concerns, and illustrates the potential of using social media to conduct “infodemiology” studies for public health. Expand
The Story of Goldilocks and Three Twitter’s APIs: A Pilot Study on Twitter Data Sources and Disclosure
This study examines whether tweets collected using the same search filters over the same time period, but calling different APIs, would retrieve comparable datasets, and retrieved tweets about anti-smoking, e-cigarettes, and tobacco using the aforementioned APIs. Expand
Twitter as a Tool for Health Research: A Systematic Review
A new taxonomy to describe Twitter use in health research with 6 categories is identified and many data elements discernible from a user's Twitter profile are underreported in the literature and can provide new opportunities to characterize the users whose data are analyzed in these studies. Expand