DR.BENCH: Diagnostic Reasoning Benchmark for Clinical Natural Language Processing

  title={DR.BENCH: Diagnostic Reasoning Benchmark for Clinical Natural Language Processing},
  author={Yanjun Gao and Dmitriy Dligach and Timothy Miller and John R. Caskey and Brihat Sharma and Matthew M. Churpek and Majid Afshar},
  journal={Journal of biomedical informatics},



Tasks as needs: reframing the paradigm of clinical natural language processing research for real-world decision support

Abstract Electronic medical records are increasingly used to store patient information in hospitals and other clinical settings. There has been a corresponding proliferation of clinical natural

Hierarchical Annotation for Building A Suite of Clinical Natural Language Processing Tasks: Progress Note Understanding

An annotated corpus based on a large collection of publicly available daily progress notes, a type of EHR that is time-sensitive, problem-oriented, and well-documented by the format of Subjective, Objective, Assessment and Plan (SOAP).

Summarizing Patients’ Problems from Hospital Progress Notes Using Pre-trained Sequence-to-Sequence Models

This work proposes a new NLP task that aims to generate a list of problems in a patient’s daily care plan using input from the provider's progress notes during hospitalization, and investigates the performance of T5 and BART, two state-of-the-art seq2seq transformer architectures.

A Scoping Review of Publicly Available Language Tasks in Clinical Natural Language Processing

A scoping review of papers on clinical natural language processing shared tasks that use publicly available electronic health record data from a cohort of patients to identify gaps with divergent interests between the general domain NLP community and the clinical informatics community for task motivation and design and in generalizability of the data sources.

Clinical Note Section Detection Using a Hidden Markov Model of Unified Medical Language System Semantic Types.

A system that used a heterogeneous hidden Markov model to identify seven note sections that show promise for an approach to segment clinical notes into sections for subsequent NLP tasks.

Lessons from Natural Language Inference in the Clinical Domain

This work introduces MedNLI - a dataset annotated by doctors, performing a natural language inference task (NLI), grounded in the medical history of patients, and presents strategies to leverage transfer learning using datasets from the open domain and incorporate domain knowledge from external data and lexical sources.

What Disease does this Patient Have? A Large-scale Open Domain Question Answering Dataset from Medical Exams

This work presents the first free-form multiple-choice OpenQA dataset for solving medical problems, MedQA, collected from the professional medical board exams, and implements both rule-based and popular neural methods by sequentially combining a document retriever and a machine comprehension model.

Evaluation of Clinical Text Segmentation to Facilitate Cohort Retrieval

It is suggested that searching specific sections may improve precision under certain conditions and often with loss of recall, although chart notes incorporate structure that may facilitate accurate retrieval.

emrQA: A Large Corpus for Question Answering on Electronic Medical Records

A novel methodology to generate domain-specific large-scale question answering (QA) datasets by re-purposing existing annotations for other NLP tasks is proposed and an instance of this methodology is demonstrated in generating a large- scale QA dataset for electronic medical records.

Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset

An in-depth analysis of the emrQA dataset and the clinical reading comprehension (CliniRC) task is provided and the ability to utilize clinical domain knowledge and to generalize to unseen questions and contexts are explored.