Annotation of Entities and Relations in Spanish Radiology Reports

@inproceedings{Cotik2017AnnotationOE,
  title={Annotation of Entities and Relations in Spanish Radiology Reports},
  author={Viviana Cotik and Dar{\'i}o Filippo and Roland Roller and Hans Uszkoreit and Feiyu Xu},
  booktitle={RANLP},
  year={2017}
}
Radiology reports express the results of a radiology study and contain information about anatomical entities, findings, measures and impressions of the medical doctor. The use of information extraction techniques can help physicians to access this information in order to understand data and to infer further knowledge. Supervised machine learning methods are very popular to address information extraction, but are usually domain and language dependent. To train new classification models… 

Figures and Tables from this paper

Pre-trained language models to extract information from radiological reports
TLDR
This paper describes the participation of the SINAI team in the SpRadIE challenge: Information Extraction from Spanish radiology reports which consists of identifying biomedical entities related to the radiological domain and proposes two approaches based on pretrained models using the BERT architecture.
Cataloguing Spanish Medical Reports with UMLS Terms
TLDR
An approach to mitigate shortcomings, which pipelines state-of-the-art language translation services with automatic mapping tools, is described and demonstrated by combining Google Translate with MetaMap and by using the resulting pipeline to catalog, with UMLS, a representative set of Spanish-written X-Ray Thorax reports corresponding to images taken from the Indiana Chest X-ray radiology corpus.
Automatic Extraction of Nested Entities in Clinical Referrals in Spanish
TLDR
A new clinical corpus rich in nested entities and a series of neural models to identify them, which can leverage statistics of diseases and pending procedures and build models for Named Entity Recognition (NER) is described.
Extracting information from radiology reports by Natural Language Processing and Deep Learning
TLDR
The best results have been obtained by the CRF model, which has obtained a Lenient F1 score of 77% for a dataset that contained most of the words in the training dataset and a 67% of Lenient score for the dataset with words that are not present in the dataset used to train the model.
A clinical trials corpus annotated with UMLS entities to enhance the access to evidence-based medicine
TLDR
This resource is adequate for experiments with state-of-the-art approaches to biomedical named entity recognition and is generalizable to other languages with similar available sources.
Window Classifiers and Conditional Random Fields for Medical Report De-Identification
TLDR
Two methods, a window classifier and an implementation of conditional random fields (CRF) are presented in order to de-identify personal information of Spanish medical records provided by the MEDDOCAN challenge.
A multi-BERT hybrid system for Named Entity Recognition in Spanish radiology reports
TLDR
The overall results demonstrate the potential of pre-trained language models and cross-lingual word alignment for limited corpus and low-resource NER in the clinical domain.
Finding Mentions of Abbreviations and Their Definitions in Spanish Clinical Cases: The BARR2 Shared Task Evaluation Results
TLDR
The overall aim of this effort was to evaluate strategies for detecting automatically mentions of abbreviations in running text, as well as returning their corresponding definition given the corresponding context from Spanish clinical case studies.
The Chilean Waiting List Corpus: a new resource for clinical Named Entity Recognition in Spanish
TLDR
This work constitutes the first annotated corpus using clinical narratives from Chile, and one of the few for the Spanish language, to obtain preliminary results for Named Entity Recognition (NER).
...
...

References

SHOWING 1-10 OF 23 REFERENCES
A fine-grained corpus annotation schema of German nephrology records
TLDR
A fine-grained annotation schema to detect named entities in German clinical data of chronically ill patients with kidney diseases and a semi-automatic annotation which uses additional sources of knowledge such as UMLS, to pre-annotate concepts in advance are presented.
Syntactic methods for negation detection in radiology reports in Spanish
TLDR
Different algorithms developed to determine if a term of interest is under the scope of negation in radiology reports written in Spanish are introduced and outperform a simple dictionary lookup algorithm developed as baseline.
Rule-based information extraction from patients' clinical data
Automatically Correlating Clinical Findings and Body Locations in Radiology Reports Using MedLEE
TLDR
A system that extracts clinical findings and body locations from radiology reports and correlates them using Medical Language Extraction and Encoding System’s semantic representation and a lightweight reasoning engine is described.
Annotating Negation in Spanish Clinical Texts
TLDR
A corpus of anamnesis and radiology reports has been annotated by two domain expert annotators with negation markers and negated events, constituting the first corpus annotated for Spanish clinical reports available for the NLP community.
Automatic Retrieval of Bone Fracture Knowledge Using Natural Language Processing
TLDR
This work developed and validated an NLP system which extracts fracture and anatomy concepts from unstructured text and retrieves relevant bone fracture knowledge in real time and implements the system in an HTML5 web application to demonstrate a proof-of-concept feedback NLPsystem.
Automated Detection of Radiology Reports that Document Non-routine Communication of Critical or Significant Results
TLDR
A rule-based query algorithm was developed after analyzing hundreds of radiology reports that indicated communication of critical or significant results to a healthcare provider, consisting of words and phrases used by radiologists to indicate such communications combined with specific handcrafted rules.
2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text
TLDR
The 2010 i2b2/VA Workshop on Natural Language Processing Challenges for Clinical Records presented three tasks, which showed that machine learning approaches could be augmented with rule-based systems to determine concepts, assertions, and relations.
...
...