The Treatment of Missing Values and its Effect on Classifier Accuracy

  title={The Treatment of Missing Values and its Effect on Classifier Accuracy},
  author={E. Acu{\~n}a and C. Rodr{\'i}guez},
  • E. Acuña, C. Rodríguez
  • Published 2004
  • Computer Science
  • The presence of missing values in a dataset can affect the performance of a classifier constructed using that dataset as a training sample. Several methods have been proposed to treat missing data and the one used most frequently deletes instances containing at least one missing value of a feature. In this paper we carry out experiments with twelve datasets to evaluate the effect on the misclassification error rate of four methods for dealing with missing values: the case deletion method, mean… CONTINUE READING
    411 Citations
    Impact of imputation of missing values on classification error for discrete data
    • 251
    • PDF
    On the choice of the best imputation methods for missing values considering three groups of classification methods
    • 161
    • PDF
    Improving accuracy rate of imputation of missing data using classifier methods
    • 7
    Predicting Missing Values in a Dataset : Challenges and Approaches
    • 2
    • PDF
    Dynamic discriminant functions with missing feature values
    • 2


    The Treatment of Missing Values in Discriminant Analysis—I. The Sampling Experiment
    • 74
    A Comparison of Several Approaches to Missing Attribute Values in Data Mining
    • 412
    • PDF
    Imputing Missing Values: The Effect on the Accuracy of Classification
    • 50
    • PDF
    Missing value estimation methods for DNA microarrays
    • 2,893
    • PDF
    Imputation techniques in regression analysis: looking closely at their implementation
    • 29
    Efficient Methods for Dealing with Missing Data in Supervised Learning
    • 89
    • PDF
    Statistical Analysis with Missing Data.
    • 3,721
    Pattern Recognition with Partly Missing Data
    • J. K. Dixon
    • Computer Science
    • IEEE Transactions on Systems, Man, and Cybernetics
    • 1979
    • 198
    • PDF
    Imputing Missing Data for Gene Expression Arrays
    • 216
    • PDF
    Analysis of Incomplete Multivariate Data
    • 4,638