Feature selection in pulmonary function test data with machine learning methods

Abstract

Pulmonary function test has vital importance in diagnosis and treatment of lung diseases. With this test, several parameters are measured such as forced vital capacity (FVC) and forced expiratory volume in the first second (FEV<sub>1</sub>) of patients. These parameters indicate different types of lung disorders. Main constraint in diagnosis is to selection of important parameters among test results. In this study, five results of pulmonary function test (PFT) are evaluated with machine learning methods and feature selections with test results are achieved. Feature selections are performed with using Naive bayes, support vector machine (SVM), linear discriminant analysis (LDA) and k-nearest neighbor classifier (k-NN) methods. The test results of 436 patients are obtained from Atatu&#x0308;rk Chest Diseases and Thoracic Surgery Training and Research Hospital in Ankara/Turkey. SVM method has a highest performance values with 89,6% accuracy, 87,4 % specificity, 71,6% sensitivity respectively. Thus, it is found with feature selection that importance order of test results are FVC, FEV<sub>1</sub>, FEV<sub>1</sub>/FVC, PEF ve FEF<sub>25/75</sub> respectively. In this study, obtained performance values are higher than most of studies in the literature.

DOI: 10.1109/SIU.2013.6531578

Cite this paper

@article{Karakis2013FeatureSI, title={Feature selection in pulmonary function test data with machine learning methods}, author={Rukiye Karakis and Inan G{\"{u}ler and Ali Hakan Isik}, journal={2013 21st Signal Processing and Communications Applications Conference (SIU)}, year={2013}, pages={1-4} }