• Corpus ID: 240288799

Using Text Analytics for Health to Get Meaningful Insights from a Corpus of COVID Scientific Papers

  title={Using Text Analytics for Health to Get Meaningful Insights from a Corpus of COVID Scientific Papers},
  author={Dmitry Soshnikov and Vickie Soshnikova},
Since the beginning of COVID pandemic, there have been around 700000 scientific papers published on the subject. A human researcher cannot possibly get acquainted with such a huge text corpus — and therefore developing AI-based tools to help navigating this corpus and deriving some useful insights from it is highly needed. In this paper, we will use Text Analytics for Health pre-trained service together with some cloud tools to extract some knowledge from scientific papers, gain insights, and… 

Figures from this paper



CORD-19: The Covid-19 Open Research Dataset

The mechanics of dataset construction are described, highlighting challenges and key design decisions, an overview of how CORD-19 has been used, and several shared tasks built around the dataset are described.

Extracting medications and associated adverse drug events using a natural language processing system combining knowledge base and deep learning

The results indicate that a well-designed hybrid NLP system is capable of ADE and medication-related information extraction, which can be used in real-world applications to support ADE-related researches and medical decisions.

The Unified Medical Language System (UMLS): integrating biomedical terminology

The Unified Medical Language System is a repository of biomedical vocabularies developed by the US National Library of Medicine and includes tools for customizing the Metathesaurus (MetamorphoSys), for generating lexical variants of concept names (lvg) and for extracting UMLS concepts from text (MetaMap).

Efficacy of chloroquine versus lopinavir/ritonavir in mild/general COVID-19 infection: a prospective, open-label, multicenter, randomized controlled clinical study

This experiment should reveal the efficacy and safety of using chloroquine versus lopinavir/ritonavir for patients with mild/general COVID-19 infection and if the new treatment including chlorine shows a higher rate of throat-swab SARS-CoV-2 real-time fluorescent reverse transcription polymerase chain reaction (RT-PCR) negativity and is safe, it could be tested as a future CO VID-19 treatment.

Performing Large Science Experiments on Azure: Pitfalls and Solutions

This paper examines the top challenges identified in their own large-scale science experiments running on the Windows Azure platform and describes a Cloud-based parameter sweep prototype (dubbed Cirrus) which provides a framework of solutions for each challenge.

Azure machine learning." Microsoft Azure Essentials

  • 2015

Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing

It is shown that for domains with abundant unlabeled text, such as biomedicine, pretraining language models from scratch results in substantial gains over continual pretraining of general-domain language models.