Corpus ID: 7104840

Multiple imputation by chained equations in praxis: Guidelines and review

@article{Wulff2017MultipleIB,
  title={Multiple imputation by chained equations in praxis: Guidelines and review},
  author={Jesper Wulff and Linda Ejlskov Jeppesen},
  journal={The Electronic Journal of Business Research Methods},
  year={2017},
  volume={15},
  pages={41-56}
}
  • J. Wulff, L. Jeppesen
  • Published 2017
  • Computer Science
  • The Electronic Journal of Business Research Methods
Multiple imputation by chained equations (MICE) is an effective tool to handle missing data an almost unavoidable problem in quantitative data analysis. However, despite the empirical and theoretical evidence supporting the use of MICE, researchers in the social sciences often resort to inferior approaches unnecessarily risking erroneous results. The complexity of the decision process when encountering missing data may be what is discouraging potential users from adopting the appropriate… Expand
Construction and assessment of prediction rules for binary outcome in the presence of missing predictor data using multiple imputation: theoretical perspective and data-based evaluation
We investigate the problem of calibration and assessment of predictive rules in prognostic designs when missing values are present in the predictors. Our paper has two key objectives which areExpand
Addressing missing values in routine health information system data: an evaluation of imputation methods using data from the Democratic Republic of the Congo during the COVID-19 pandemic
TLDR
Seven commonly used imputation methods are implemented and recommended for the use of multiple imputation in addressing missing values in RHIS datasets and appropriate handling of data structure to minimize imputation standard errors. Expand
Fairness without Imputation: A Decision Tree Approach for Fair Prediction with Missing Values
TLDR
This paper proposes an integrated approach based on decision trees that does not require a separate process of imputation and learning, and trains a tree with missing incorporated as attribute (MIA), whichdoes not require explicit imputation, and optimize a fairness-regularized objective function. Expand
biokNN: A bi-objective imputation method for multilevel data in R
The biokNN package focus on the imputation of missing values for multilevel datasets using a biobjective k-Nearest Neighbors (biokNN) method. The package provides functions to produce single andExpand
Filling the gaps: imputation of missing metrics' values in a software quality model
TLDR
Empirically validate a few imputation methods in context of a custom Géant-QM framework, used for evaluation of several open source systems, and results indicate imputing a missing value based on its close neighbors as data donors introduces less noise that using a wider set of donors. Expand
Tuberculosis treatment outcomes of notified cases: trends and determinants of potential unfavourable outcome, France, 2008 to 2014
TLDR
Monitoring of treatment outcome is improving over time, but treatment outcome monitoring needs to be strengthened in cases belonging to population groups where the percentage of unfavourable outcome is the highest and in cases where surveillance data shows poorer documented follow-up. Expand
Risk factors for increased COVID-19 case-fatality in the United States: A county-level analysis during the first wave
TLDR
County-level variables associated with the COVID-19 case-fatality rate (CFR) using publicly available datasets and a negative binomial generalized linear model are identified to help officials target public health interventions and healthcare resources to locations that are at increased risk of CO VID-19 fatalities. Expand
Risk factors for increased COVID-19 case-fatality in the United States: A county-level analysis during the first wave
TLDR
County-level variables associated with the COVID-19 case-fatality rate (CFR) using publicly available datasets and a negative binomial generalized linear model are identified to help officials target public health interventions and healthcare resources to locations that are at increased risk of CO VID-19 fatalities. Expand
Investigating the association between socio-economic position and stillbirth in Brazil and the UK
Stillbirths are under-researched in comparison to other child health outcomes. They rarely appear in national health targets or commitments, despite being an immense global burden with 2.6 millionExpand
Vaccination coverage estimation in Mexico in children under five years old: Trends and associated factors
TLDR
The estimated vaccination coverage and CBS do not reach sufficient levels to guarantee herd immunity, hence innovative strategies to improve vaccination must be established in Mexico. Expand
...
1
2
3
4
...

References

SHOWING 1-10 OF 70 REFERENCES
Analyzing Incomplete Political Science Data: An Alternative Algorithm for Multiple Imputation
We propose a remedy for the discrepancy between the way political scientists analyze data with missing values and the recommendations of the statistics community. Methodologists and statisticiansExpand
Multiple imputation by chained equations: what is it and how does it work?
TLDR
This paper provides an introduction to the MICE method with a focus on practical aspects and challenges in using this method. Expand
How should variable selection be performed with multiply imputed data?
TLDR
Most methods improve on the naïve complete-case analysis for variable selection, but importantly the type 1 error is only preserved if selection is based on RR, which is the recommended approach. Expand
Multiple imputation for missing data via sequential regression trees.
TLDR
The authors present a nonparametric approach for implementing multiple imputation via chained equations by using sequential regression trees as the conditional models and demonstrate that the method can result in more plausible imputations, and hence more reliable inferences, in complex settings than the naive application of standard sequential regression imputation techniques. Expand
MICE: Multivariate Imputation by Chained Equations in R
TLDR
Mice adds new functionality for imputing multilevel data, automatic predictor selection, data handling, post-processing imputed values, specialized pooling routines, model selection tools, and diagnostic graphs. Expand
A critical look at methods for handling missing covariates in epidemiologic regression analyses.
TLDR
The authors recommend that epidemiologists avoid using the missing-indicator method and use more sophisticated methods whenever a large proportion of data are missing, and contrast the results of multiple imputation to simple methods in the analysis of a case-control study of endometrial cancer. Expand
Multiple Imputation for Missing Data: Making the most of What you Know
TLDR
This article presents a simulation and data analysis case study using a method for dealing with missing data, multiple imputation, that allows for valid statistical inference with complete case statistical analysis. Expand
Multiple imputation: current perspectives
TLDR
An overview of multiple imputation and current perspectives on its use in medical research, showing how the use of so-called uncongenial imputation models are particularly valuable for sensitivity analyses and also for certain analyses in clinical trial settings. Expand
Multiple imputation of missing covariate values in multilevel models with random slopes: a cautionary note
TLDR
It is suggested that MI is able to recover most parameters, but is currently not well suited to capture slope variation entirely when covariate values are missing, and listwise deletion can be an alternative worth considering when preserving the slope variance is particularly important. Expand
Performance of Sequential Imputation Method in Multilevel Applications
TLDR
In most realistic applications, the simulations suggest that the sequential method lead to well-calibrated estimates and in some settings the performance is even better than the more conventional methods with well-defined joint model in some scenarios. Expand
...
1
2
3
4
5
...