Enhancing instance-based classification with local density: a new algorithm for classifying unbalanced biomedical data

Abstract

MOTIVATION Classification is an important data mining task in biomedicine. In particular, classification on biomedical data often claims the separation of pathological and healthy samples with highest discriminatory performance for diagnostic issues. Even more important than the overall accuracy is the balance of a classifier, particularly if datasets of unbalanced class size are examined. RESULTS We present a novel instance-based classification technique which takes both information of different local density of data objects and local cluster structures into account. Our method, which adopts the basic ideas of density-based outlier detection, determines the local point density in the neighborhood of an object to be classified and of all clusters in the corresponding region. A data object is assigned to that class where it fits best into the local cluster structure. The experimental evaluation on biomedical data demonstrates that our approach outperforms most popular classification methods. AVAILABILITY The algorithm LCF is available for testing under http://biomed.umit.at/upload/lcfx.zip.

DOI: 10.1093/bioinformatics/btl027

Extracted Key Phrases

11 Figures and Tables

Statistics

05010020072008200920102011201220132014201520162017
Citations per Year

119 Citations

Semantic Scholar estimates that this publication has 119 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@article{Plant2006EnhancingIC, title={Enhancing instance-based classification with local density: a new algorithm for classifying unbalanced biomedical data}, author={Claudia Plant and Christian B{\"{o}hm and Bernhard Tilg and Christian Baumgartner}, journal={Bioinformatics}, year={2006}, volume={22 8}, pages={981-8} }