A shared task involving multi-label classification of clinical free text

@inproceedings{Pestian2007AST,
  title={A shared task involving multi-label classification of clinical free text},
  author={John P. Pestian and Chris Brew and Pawel Matykiewicz and D. J. Hovermale and Neil Johnson and Kevin Bretonnel Cohen and Wlodzislaw Duch},
  booktitle={BioNLP@ACL},
  year={2007}
}
This paper reports on a shared task involving the assignment of ICD-9-CM codes to radiology reports. [] Key Result Many systems performed at levels approaching the inter-coder agreement, suggesting that human-like performance on this task is within the reach of currently available technologies.

Figures and Tables from this paper

Sentiment Analysis of Suicide Notes: A Shared Task
TLDR
A shared task involving the assignment of emotions to suicide notes resulted in the corpus of fully anonymized clinical text and annotated suicide notes, suggesting that human-like performance on this task is within the reach of currently available technologies.
Machine Learning to Automate the Assignment of Diagnosis Codes to Free-text Radiology Reports : a Method Description
TLDR
A multi-label classification system for the automated assignment of diagnostic codes to radiology reports is introduced, which provides insight into the development of applications for reallife usage, which are currently rare.
Enhancing Automatic ICD-9-CM Code Assignment for Medical Texts with PubMed
TLDR
This work proposes to strategically draw data from PubMed to enrich the training data when there is such need, and results indicate that the method can significantly improve the code assignment classifiers' performance at the macro-averaging level.
ICD Code Retrieval: Novel Approach for Assisted Disease Classification
TLDR
This paper presents a novel incremental approach to clinical Text Classification, which overcomes the low accuracy problem through the top-K retrieval, exploits Transfer Learning techniques in order to expand a skewed dataset and improves the overall accuracy over time, learning from user selection.
Semantic Annotation of Clinical Text : The CLEF Corpus
TLDR
The paper describes the make-up of the annotated corpus, the semantic annotation schemes used to annotate it, details of the annotation process and of inter-annotator agreement studies, and how the annotations are being used for developing supervised machine learning models for IE tasks.
Annotating and Recognising Named Entities in Clinical Notes
TLDR
A new genre of text which are not well-written, noise prone, ungrammatical and with much cryptic content is introduced, which is a mix of clinical progress notes drawn form an Intensive Care Service and clinical named entities.
Multi-Label Classification of Patient Notes a Case Study on ICD Code Assignment
TLDR
HA-GRU, a hierarchical approach to tag a document by identifying the sentences relevant for each label achieves state-of-the art results and highlights the model decision process, allows easier error analysis, and suggests future directions for improvement.
Simple approaches to disease classification based on clinical patient records
TLDR
This study describes the system submitted by the team of University of Szeged to the Second I2B2 Challenge in Natural Language Processing for Clinical Data, which expected a system with dictionaries gathered semi-automatically to show a good performance with moderate development costs.
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 44 REFERENCES
Preparing Clinical Text for Use in Biomedical Research
TLDR
C Cincinnati Children’s Hospital Medical Center (CCHMC), a large pediatric academic medical center with more than 761,000 annual patient encounters, developed open source software for making pediatric clinical text harmless without losing its rich meaning.
Second i2b2 workshop on natural language processing challenges for clinical records.
  • Ozlem Uzuner
  • Medicine, Computer Science
    AMIA ... Annual Symposium proceedings. AMIA Symposium
  • 2008
TLDR
The obesity challenge is discussed, some approaches to automatically identifying obese patients and obesity co-morbidities from medical records are reviewed, and the challenge results are presented.
Development of a Pediatric Text-Corpus for Part-of-Speech Tagging
TLDR
The status of an ongoing project to create a large corpus and lexicon for use by part-of-speech tagger and other NLP research tools, aimed at developing new methods in sciences related to medical domains is presented.
Role of Local Context in Automatic Deidentification of Ungrammatical, Fragmented Text
TLDR
It is shown that one can deidentify medical discharge summaries using support vector machines that rely on a statistical representation of local context, which contributes more to deidentification than dictionaries and hand-tailed heuristics.
TREC 2004 Genomics Track Overview
TLDR
The TREC 2004 Genomics Track consisted of two tasks that focused on categorization of full-text documents, simulating the task of curators of the Mouse Genome Informatics (MGI) system and consisting of three subtasks.
Text Boundary Detection of Medical Reports
TLDR
A critical step in information gathering involves segmentation of reports into topically cohesive sections and sentences to help realize timely, intelligent analysis and communication toward improved clinical outcome.
The sublanguage of cross-coverage
TLDR
Free-text "Signout" notes appear to constitute a unique sublanguage of medicine and are compared to other common medical notes on a series of quantitative metrics to better understand the requirements for parsing.
Multi-label Semantic Scene Classfication
TLDR
A framework to handle semantic scene classification, where a natural scene may contain multiple objects such that the scene can be described by multiple class labels, is presented and appears to generalize to other classification problems of the same nature.
Natural language processing for online applications : text retrieval, extraction and categorization
This text covers the emerging technologies of document retrieval, information extraction, and text categorization in a way which highlights commonalities in terms of both general principles and
...
1
2
3
4
5
...