Multiple Imputation for Categorical Time Series

  title={Multiple Imputation for Categorical Time Series},
  author={Brendan Halpin},
  journal={The Stata Journal},
  pages={590 - 612}
  • B. Halpin
  • Published 1 September 2016
  • Computer Science
  • The Stata Journal
The mict package provides a method for multiple imputation of categorical time-series data (such as life course or employment status histories) that preserves longitudinal consistency, using a monotonic series of imputations. It allows flexible imputation specifications with a model appropriate to the target variable (mlogit, ologit, etc.). Where transitions in individual units’ data are substantially less frequent than one per period and where missingness tends to be consecutive (as is typical… Expand

Figures and Tables from this paper

Time-Series Imputation with Wasserstein Interpolation for Optimal Look-Ahead-Bias and Variance Tradeoff
By connecting layers of information revealed in time, this work proposes a Bayesian posterior consensus distribution which optimally controls the variance and look-ahead-bias trade-off in the imputation. Expand
Duration, timing and order: How housing histories relate to later life wellbeing
Accumulation, critical period and social mobility are three powerful, interrelated life course mechanisms often tested using relatively crude empirical measures. This contribution wants to highlightExpand
Career trajectories and cumulative wages: The case of temporary employment
Abstract Using data from the German Socio-Economic Panel (SOEP, 1994–2017), we investigate how 10-year career sequences, which can be grouped into distinct career patterns, relate to cumulative laborExpand
Paid work, household work, or leisure? Time allocation pathways among women following a cancer diagnosis.
Being diagnosed with breast cancer or skin cancer has no significant long-term effect on the pathways of time allocation, and future work should explore the relationship between time allocation and wellbeing and the preference for time allocation. Expand
Trajectory Modelling Techniques Useful to Epidemiological Research: A Comparative Narrative Review of Approaches
Improved consistency in the terminology and reporting guidelines have the potential to increase researchers’ efficiency when it comes to choosing the most appropriate technique that best suits their research questions. Expand
Pathways to death: The co-occurrence of physical and mental health in the last years of life
[Background:] Physical and mental health are important markers of quality of life. Little is known about how they unfold in tandem in the last years of life. [Objective:] We take a life-courseExpand
Presenting a data imputation concept to support the continuous assessment of human vital data and activities
A data imputation concept is introduced which complements and thus fixes incomplete datasets by using a new merging approach that is particularly suitable for assessing activities and vital data. Expand
SADI: Sequence Analysis Tools for Stata
The SADI package provides tools for sequence analysis, which focuses on the similarity and dissimilarity between categorical time series such as life-course trajectories. SADI‘s main components areExpand
Trajectories of Vulnerability: A Sequence-Analytical Approach
A growing proportion of the European population faces situations of vulnerability. Stable employees feel more and more at risk of losing their job or of experiencing a deterioration of theirExpand
Riding the Waves of Work and Life: Explaining Long-Term Experiences with Work Hour Mismatches
Paid work has become more precarious in the recent decades, prompting many conflicts between employers and employees, including struggles over work hours. To better understand these struggles, weExpand


Multiple Imputation of Missing Values: Further Update of Ice, with an Emphasis on Categorical Variables
The ice package for multiple imputation by chained equations (also known as fully conditional specification) is further updated, with special attention paid to categorical variables. Expand
Multiple imputation of missing blood pressure covariates in survival analysis.
A non-response problem in survival analysis where the occurrence of missing data in the risk factor is related to mortality is studied, and multiple imputation is used to impute missing blood pressure and then analyse the data under a variety of non- response models. Expand
Multiple imputation of discrete and continuous data by fully conditional specification
FCS is a semi-parametric and flexible alternative that specifies the multivariate model by a series of conditional models, one for each incomplete variable, but its statistical properties are difficult to establish. Expand
Application of Multiple Imputation using the Two-Fold Fully Conditional Specification Algorithm in Longitudinal Clinical Data
A new command is described, twofold, that implements the two-fold fully conditional specification algorithm to impute missing data in longitudinal data, extended to accommodate MI of longitudinal clinical records in large databases. Expand
Missing values in longitudinal dietary data: a multiple imputation approach based on a fully conditional specification.
This goal has been to develop a valid and efficient MI procedure for the Diabetes Prediction and Prevention Nutrition Study, in which the diet of a cohort of newborn children with HLA-DQB1-conferred susceptibility to type 1 diabetes is repeatedly measured by 3-day food records over early childhood. Expand
What to Do about Missing Values in Time‐Series Cross‐Section Data
Applications of modern methods for analyzing data with missing values, based primarily on multiple imputation, have in the last half‐decade become common in American politics and political behavior.Expand
MICE: Multivariate Imputation by Chained Equations in R
Mice adds new functionality for imputing multilevel data, automatic predictor selection, data handling, post-processing imputed values, specialized pooling routines, model selection tools, and diagnostic graphs. Expand
Unified BHPS Work-Life Histories: Combining Multiple Sources into a User-Friendly Format
Longitudinal data is often difficult to use, and continuous histories collected in a panel are a particularly unfriendly case. This paper reports an exercise to re-organise the British HouseholdExpand
Career pathways for temporary workers: exploring heterogeneous mobility dynamics with sequence analysis.
A typology of trajectories is derived and age, gender, and type of temporary work stand out as important factors shaping subsequent mobility patterns and their relative precariousness in relation to employment stability and wage and earnings levels and growth. Expand
Multiple imputation for nonresponse in surveys
This work focuses on the development of Imputation Models for Social Security Benefit Reconciliation in the context of a Finite Population and examines the role of Bayesian and Randomization--Based Inferences in these models. Expand