A shared task involving multi-label classification of clinical free text
@inproceedings{Pestian2007AST, title={A shared task involving multi-label classification of clinical free text}, author={John P. Pestian and Chris Brew and Pawel Matykiewicz and D. J. Hovermale and Neil Johnson and Kevin Bretonnel Cohen and Wlodzislaw Duch}, booktitle={BioNLP@ACL}, year={2007} }
This paper reports on a shared task involving the assignment of ICD-9-CM codes to radiology reports. [] Key Result Many systems performed at levels approaching the inter-coder agreement, suggesting that human-like performance on this task is within the reach of currently available technologies.
382 Citations
Sentiment Analysis of Suicide Notes: A Shared Task
- Psychology, Computer ScienceBiomedical informatics insights
- 2012
A shared task involving the assignment of emotions to suicide notes resulted in the corpus of fully anonymized clinical text and annotated suicide notes, suggesting that human-like performance on this task is within the reach of currently available technologies.
Machine Learning to Automate the Assignment of Diagnosis Codes to Free-text Radiology Reports : a Method Description
- Computer Science
- 2008
A multi-label classification system for the automated assignment of diagnostic codes to radiology reports is introduced, which provides insight into the development of applications for reallife usage, which are currently rare.
Enhancing Automatic ICD-9-CM Code Assignment for Medical Texts with PubMed
- Computer ScienceBioNLP
- 2017
This work proposes to strategically draw data from PubMed to enrich the training data when there is such need, and results indicate that the method can significantly improve the code assignment classifiers' performance at the macro-averaging level.
ICD Code Retrieval: Novel Approach for Assisted Disease Classification
- Computer ScienceDILS
- 2015
This paper presents a novel incremental approach to clinical Text Classification, which overcomes the low accuracy problem through the top-K retrieval, exploits Transfer Learning techniques in order to expand a skewed dataset and improves the overall accuracy over time, learning from user selection.
Semantic Annotation of Clinical Text : The CLEF Corpus
- Computer Science
- 2008
The paper describes the make-up of the annotated corpus, the semantic annotation schemes used to annotate it, details of the annotation process and of inter-annotator agreement studies, and how the annotations are being used for developing supervised machine learning models for IE tasks.
Annotating and Recognising Named Entities in Clinical Notes
- Computer ScienceACL
- 2009
A new genre of text which are not well-written, noise prone, ungrammatical and with much cryptic content is introduced, which is a mix of clinical progress notes drawn form an Intensive Care Service and clinical named entities.
Multi-Label Classification of Patient Notes a Case Study on ICD Code Assignment
- Computer ScienceAAAI Workshops
- 2018
HA-GRU, a hierarchical approach to tag a document by identifying the sentences relevant for each label achieves state-of-the art results and highlights the model decision process, allows easier error analysis, and suggests future directions for improvement.
Assigning clinical codes with data-driven concept representation on Dutch clinical free text
- Computer ScienceJ. Biomed. Informatics
- 2017
Simple approaches to disease classification based on clinical patient records
- Computer Science
- 2008
This study describes the system submitted by the team of University of Szeged to the Second I2B2 Challenge in Natural Language Processing for Clinical Data, which expected a system with dictionaries gathered semi-automatically to show a good performance with moderate development costs.
Multi-label clinical document classification: Impact of label-density
- Computer ScienceExpert Syst. Appl.
- 2019
References
SHOWING 1-10 OF 44 REFERENCES
Preparing Clinical Text for Use in Biomedical Research
- MedicineJ. Database Manag.
- 2006
C Cincinnati Children’s Hospital Medical Center (CCHMC), a large pediatric academic medical center with more than 761,000 annual patient encounters, developed open source software for making pediatric clinical text harmless without losing its rich meaning.
Second i2b2 workshop on natural language processing challenges for clinical records.
- Medicine, Computer ScienceAMIA ... Annual Symposium proceedings. AMIA Symposium
- 2008
The obesity challenge is discussed, some approaches to automatically identifying obese patients and obesity co-morbidities from medical records are reviewed, and the challenge results are presented.
Development of a Pediatric Text-Corpus for Part-of-Speech Tagging
- Computer ScienceIntelligent Information Systems
- 2004
The status of an ongoing project to create a large corpus and lexicon for use by part-of-speech tagger and other NLP research tools, aimed at developing new methods in sciences related to medical domains is presented.
Role of Local Context in Automatic Deidentification of Ungrammatical, Fragmented Text
- Computer ScienceNAACL
- 2006
It is shown that one can deidentify medical discharge summaries using support vector machines that rely on a statistical representation of local context, which contributes more to deidentification than dictionaries and hand-tailed heuristics.
TREC 2004 Genomics Track Overview
- Computer ScienceTREC
- 2004
The TREC 2004 Genomics Track consisted of two tasks that focused on categorization of full-text documents, simulating the task of curators of the Mouse Genome Informatics (MGI) system and consisting of three subtasks.
Two biomedical sublanguages: a description based on the theories of Zellig Harris
- Computer ScienceJ. Biomed. Informatics
- 2002
Text Boundary Detection of Medical Reports
- MedicineAMIA
- 2002
A critical step in information gathering involves segmentation of reports into topically cohesive sections and sentences to help realize timely, intelligent analysis and communication toward improved clinical outcome.
The sublanguage of cross-coverage
- MedicineAMIA
- 2002
Free-text "Signout" notes appear to constitute a unique sublanguage of medicine and are compared to other common medical notes on a series of quantitative metrics to better understand the requirements for parsing.
Multi-label Semantic Scene Classfication
- Computer Science
- 2003
A framework to handle semantic scene classification, where a natural scene may contain multiple objects such that the scene can be described by multiple class labels, is presented and appears to generalize to other classification problems of the same nature.
Natural language processing for online applications : text retrieval, extraction and categorization
- Computer Science
- 2002
This text covers the emerging technologies of document retrieval, information extraction, and text categorization in a way which highlights commonalities in terms of both general principles and…