DR.BENCH: Diagnostic Reasoning Benchmark for Clinical Natural Language Processing
@article{Gao2022DRBENCHDR, title={DR.BENCH: Diagnostic Reasoning Benchmark for Clinical Natural Language Processing}, author={Yanjun Gao and Dmitriy Dligach and Timothy Miller and John R. Caskey and Brihat Sharma and Matthew M. Churpek and Majid Afshar}, journal={Journal of biomedical informatics}, year={2022}, pages={ 104286 } }
Figures and Tables from this paper
References
SHOWING 1-10 OF 59 REFERENCES
Tasks as needs: reframing the paradigm of clinical natural language processing research for real-world decision support
- MedicineJ. Am. Medical Informatics Assoc.
- 2022
Abstract Electronic medical records are increasingly used to store patient information in hospitals and other clinical settings. There has been a corresponding proliferation of clinical natural…
Hierarchical Annotation for Building A Suite of Clinical Natural Language Processing Tasks: Progress Note Understanding
- Computer ScienceLREC
- 2022
An annotated corpus based on a large collection of publicly available daily progress notes, a type of EHR that is time-sensitive, problem-oriented, and well-documented by the format of Subjective, Objective, Assessment and Plan (SOAP).
Summarizing Patients’ Problems from Hospital Progress Notes Using Pre-trained Sequence-to-Sequence Models
- Computer ScienceCOLING
- 2022
This work proposes a new NLP task that aims to generate a list of problems in a patient’s daily care plan using input from the provider's progress notes during hospitalization, and investigates the performance of T5 and BART, two state-of-the-art seq2seq transformer architectures.
A Scoping Review of Publicly Available Language Tasks in Clinical Natural Language Processing
- Computer ScienceJ. Am. Medical Informatics Assoc.
- 2022
A scoping review of papers on clinical natural language processing shared tasks that use publicly available electronic health record data from a cohort of patients to identify gaps with divergent interests between the general domain NLP community and the clinical informatics community for task motivation and design and in generalizability of the data sources.
Clinical Note Section Detection Using a Hidden Markov Model of Unified Medical Language System Semantic Types.
- MedicineAMIA ... Annual Symposium proceedings. AMIA Symposium
- 2021
A system that used a heterogeneous hidden Markov model to identify seven note sections that show promise for an approach to segment clinical notes into sections for subsequent NLP tasks.
Lessons from Natural Language Inference in the Clinical Domain
- Computer ScienceEMNLP
- 2018
This work introduces MedNLI - a dataset annotated by doctors, performing a natural language inference task (NLI), grounded in the medical history of patients, and presents strategies to leverage transfer learning using datasets from the open domain and incorporate domain knowledge from external data and lexical sources.
What Disease does this Patient Have? A Large-scale Open Domain Question Answering Dataset from Medical Exams
- Computer ScienceApplied Sciences
- 2021
This work presents the first free-form multiple-choice OpenQA dataset for solving medical problems, MedQA, collected from the professional medical board exams, and implements both rule-based and popular neural methods by sequentially combining a document retriever and a machine comprehension model.
Evaluation of Clinical Text Segmentation to Facilitate Cohort Retrieval
- Computer ScienceAMIA
- 2017
It is suggested that searching specific sections may improve precision under certain conditions and often with loss of recall, although chart notes incorporate structure that may facilitate accurate retrieval.
emrQA: A Large Corpus for Question Answering on Electronic Medical Records
- Computer ScienceEMNLP
- 2018
A novel methodology to generate domain-specific large-scale question answering (QA) datasets by re-purposing existing annotations for other NLP tasks is proposed and an instance of this methodology is demonstrated in generating a large- scale QA dataset for electronic medical records.
Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset
- Computer ScienceACL
- 2020
An in-depth analysis of the emrQA dataset and the clinical reading comprehension (CliniRC) task is provided and the ability to utilize clinical domain knowledge and to generalize to unseen questions and contexts are explored.