• Corpus ID: 39755243

Variance inflation factors in the analysis of complex survey data

  title={Variance inflation factors in the analysis of complex survey data},
  author={Dan Liao and Richard Valliant},
Survey data are often used to fit linear regression models. The values of covariates used in modeling are not controlled as they might be in an experiment. Thus, collinearity among the covariates is an inevitable problem in the analysis of survey data. Although many books and articles have described the collinearity problem and proposed strategies to understand, assess and handle its presence, the survey literature has not provided appropriate diagnostic tools to evaluate its impact on… 

Figures and Tables from this paper

Condition indexes and variance decompositions for diagnosing collinearity in linear model analysis of survey data

Collinearities among explanatory variables in linear regression models affect estimates from survey data just as t hey do in non-survey data. Unde sirable effects are unnecessarily inflated standard

Statistical inference of the multiple regression analysis of complex survey data

The decision was made to research the linear modeling of a continuous response under CS with the objective of illustrating how the results could be illustrated if the statistician ignores the complex design of the data or naïvely applies WLS in comparison to the correct SWLS regression.

Linear Regression Diagnostics in Cluster Samples

Abstract An extensive set of diagnostics for linear regression models has been developed to handle nonsurvey data. The models and the sampling plans used for finite populations often entail

Selected Model Systematic Sequence via Variance Inflationary Factor

Literature reviews revealed that multicollinearity always exists when model a deals with several independent variables. This phenomenon can cause the t statistic and the related probability-value to

Some dimension reduction strategies for the analysis of survey data

An overview of some classic and modern dimension reduction methods is provided, followed by a discussion of how to use the transformed variables in the context of analyzing survey data.

Statistical Methods for Item Reduction in a Representative Lifestyle Questionnaire: Pilot Questionnaire Study

It is argued that before finalizing any lifestyle questionnaire, a posteriori validation should always be conducted using multiple approaches to ensure the robustness of the results and that questionnaire designers should consider using multiple methods for item reduction.

Calibration and Other Uses of Auxiliary Data in Weighting

The previous chapter described the first few steps used in weight calculation: base weights, adjustments of unknown eligibility, and nonresponse adjustments. The last step, which is extremely

A Bayesian Approach for the Variance of Fine Stratification

Fine stratification is a popular design as it permits the stratification to be carried out to the fullest possible extent. Some examples include the Current Population Survey and National Crime

Zero‐inflated modelling for characterizing coverage errors of extracts from the US Census Bureau's Master Address File

A set of statistical models for the Master Address File that will produce estimates of coverage error at levels of geography down to the block level is discussed, and zero‐inflated regression modelling is used to determine the predicted distribution of additions and deletions.

Collinearity Diagnostics for Complex Survey Data

Title of dissertation: Collinearity Diagnostics for Complex Survey Data Dan Liao Doctor of Philosophy, 2010 Dissertation directed by: Professor Richard Valliant Joint Program in Survey Methodology

Linear regression influence diagnostics for unclustered survey data

Diagnostics for linear regression models have largely been developed to handle nonsurvey data. The models and the sampling plans used for finite populations often entail stratification, clustering,

Regression Diagnostics for Complex Survey Data: Identification of Influential Observations

Title of Dissertation: REGRESSION DIAGNOSTICS FOR COMPLEX SURVEY DATA: IDENTIFICATION OF INFLUENTIAL OBSERVATIONS. Jianzhu Li, Doctor of Philosophy, 2007 Dissertation Directed By: Professor Richard

Survey weighted hat matrix and leverages

Regression diagnostics are geared toward identifying individual points or groups of points that have an important influence on a fitted model. When fitting a model with survey data, the sources of

The impact of collinearity involving the intercept term on the numerical accuracy of regression

It is well known that multiple linear regression models with ill-conditioning can produce coefficient estimates with degraded numerical accuracy. This study examines the numerical accuracy of

Regression Estimation for Survey Samples

Regression and regression related procedures have become common in survey estimation. We review the basic properties of regression estimators, discuss implementation of regression estimation, and

Generalized Collinearity Diagnostics

Abstract Working in the context of the linear model y = Xβ + e, we generalize the concept of variance inflation as a measure of collinearity to a subset of parameters in β (denoted by β 1, with the

Linear statistical models and related methods : with applications to social research

Linear statistical models and related methods with applications to social research. Fox J New York, New York, John Wiley and Sons, 1984. xx, 449 p. (Wiley Series in Probability and Mathematical

Conditioning Diagnostics: Collinearity and Weak Data in Regression

Regression Diagnostics: Identifying Influential Data and Sources of Collinearity

This chapter discusses Detecting Influential Observations and Outliers, a method for assessing Collinearity, and its applications in medicine and science.