Akaike's Information Criterion in Generalized Estimating Equations

  title={Akaike's Information Criterion in Generalized Estimating Equations},
  author={Wei Pan},
  • W. Pan
  • Published 1 March 2001
  • Mathematics
  • Biometrics
Summary. Correlated response data are common in biomedical studies. Regression analysis based on the generalized estimating equations (GEE) is an increasingly important method for such data. However, there seem to be few model‐selection criteria available in GEE. The well‐known Akaike Information Criterion (AIC) cannot be directly applied since AIC is based on maximum likelihood estimation while GEE is nonlikelihood based. We propose a modification to AIC, where the likelihood is replaced by… 

Tables from this paper

A Modified Model-Selection Criteria in a Generalised Estimating Equation for Latent Class Regression Models

In recent years, generalised estimating equations (GEEs) have played an important role in many fields of research, such as biomedicine. In this paper, we use GEEs for latent class regression (LCR)

Model Selection Using Modified Akaike’s Information Criterion: An Application to Maternal Morbidity Data

The most commonly used model selection criterion, Akaike’s Information Criterion (AIC), cannot be used when the Generalized Estimating Equations (GEE) approach is considered for analyzing

Variable Selection and Model Averaging for Longitudinal Data Incorporating GEE Approach

The Akaike Criterion, which is based on maximum likelihood estimation and cannot be applied directly to the situations when likelihood functions are not available, has been modified for variable

QIC Program and Model Selection in GEE Analyses

The generalized estimating equation (GEE) approach is a widely used statistical method in the analysis of longitudinal data in clinical and epidemiological studies. It is an extension of the

Penalized Empirical Likelihood-Based Variable Selection for Longitudinal Data Analysis

A large number of covariates have become common in many applications such as epidemiology, clinical research, and therapeutic evaluation and the identification of a sub-model responsible for these covariates is a natural step in the development of a model.


The generalized estimating equation (GEE) approach is becoming more and more popular in handling correlated response data, for example in longitudi- nal studies. An attractive property of the GEE is

Correlation structure and variable selection in generalized estimating equations via composite likelihood information criteria

The weighted scores method for generalized linear model margins is introduced, which weight the univariate score equations using a working discretized multivariate normal model that is a proper multivariate model.

Selection of Working Correlation Structure in Generalized Estimating Equations via Empirical Likelihood

Generalized estimating equations (GEE) are a popular class of models for analyzing discrete longitudinal data, and do not require the specification of a full likelihood. The GEE estimator for the

Influence measures based on the volume of confidence ellipsoids for GEE

This article presents an extension of generalized estimating equations measures that is valid for correlated-measures regression analysis using GEEs and illustrated by an analysis of epileptic seizure count data arising from a study of prograbide as an adjuvant therapy for partial seizures and some simulated data sets.

Generalized Estimating Equations in Longitudinal Data Analysis: A Review and Recent Developments

A systematic review on GEE including basic concepts as well as several recent developments due to practical challenges in real applications, including the selection of “working” correlation structure, sample size and power calculation, and the issue of informative cluster size are covered.



A caveat concerning independence estimating equations with multivariate binary data.

This paper considers logistic regression models for multivariate binary responses, where the association between the responses is largely regarded as a nuisance characteristic of the data, and considers the estimator based on independence estimating equations (IEE), which assumes that the responses are independent.

Goodness-of-fit tests for GEE modeling with binary responses.

This work proposes model-based and robust (empirically corrected) goodness-of-fit tests for GEE modeling with binary responses based on partitioning the space of covariates into distinct regions and forming score statistics that are asymptotically distributed as chi-square random variables with the appropriate degrees of freedom.

Models for longitudinal data: a generalized estimating equation approach.

This article discusses extensions of generalized linear models for the analysis of longitudinal data in which heterogeneity in regression parameters is explicitly modelled and uses a generalized estimating equation approach to fit both classes of models for discrete and continuous outcomes.

Longitudinal data analysis using generalized linear models

SUMMARY This paper proposes an extension of generalized linear models to the analysis of longitudinal data. We introduce a class of estimating equations that give consistent estimates of the

A cautionary note on inference for marginal regression models with longitudinal data and general correlated response data

Inference for cross-sectional models using longitudinal data, can be accomplished with generalized estimating equations (Zeger and Liang, 1992). We show that either a diagonal working covariance

Longitudinal data analysis for discrete and continuous outcomes.

A class of generalized estimating equations (GEEs) for the regression parameters is proposed, extensions of those used in quasi-likelihood methods which have solutions which are consistent and asymptotically Gaussian even when the time dependence is misspecified as the authors often expect.

Estimating Logistic Regression Parameters for Bivariate Binary Data

SUMMARY Consider bivariate binary data, with possibly different covariates for each marginal binary observation. Suppose that the correlation between paired observations is a nuisance, and the

Approximate likelihood ratios for general estimating functions

SUMMARY The method of estimating functions (Godambe, 1991) is commonly used when one desires to conduct inference about some parameters of interest but the full distribution of the observations is

Quasi-likelihood functions, generalized linear models, and the Gauss-Newton method

SUMMARY To define a likelihood we have to specify the form of distribution of the observations, but to define a quasi-likelihood function we need only specify a relation between the mean and variance

Subset Selection in Regression

OBJECTIVES Prediction, Explanation, Elimination or What? How Many Variables in the Prediction Formula? Alternatives to Using Subsets 'Black Box' Use of Best-Subsets Techniques LEAST-SQUARES