Coefficients of Determination in Logistic Regression Models—A New Proposal: The Coefficient of Discrimination

  title={Coefficients of Determination in Logistic Regression Models—A New Proposal: The Coefficient of Discrimination},
  author={Tue Tjur},
  journal={The American Statistician},
  pages={366 - 372}
  • T. Tjur
  • Published 2009
  • Mathematics
  • The American Statistician
Many analogues to the coefficient of determination R2 in ordinary regression models have been proposed in the context of logistic regression. Our starting point is a study of three definitions related to quadratic measures of variation. We discuss the properties of these statistics, and show that the family can be extended in a natural way by a fourth statistic with an even simpler interpretation, namely the difference between the averages of fitted values for successes and failures… Expand
Coefficient of determination for multiple measurement error models
The goodness of fit statistics based on the variants of R^2 for multiple measurement error models have been proposed in this paperBased on the utilization of the two forms of additional information from outside the sample, the known covariance matrix of measurement errors associated with the explanatory variables and the known reliability matrix associated withThe explanatory variables. Expand
R2 measures for zero-inflated regression models for count data with excess zeros
ABSTRACT Generalized linear models are often used to analyse discrete data. There are many proposed measures for this class of models. For loglinear models for count data, Cameron and Windmeijer [AnExpand
Measures of Fit for Logistic Regression
One of the most common questions about logistic regression is “How do I know if my model fits the data?” There are many approaches to answering this question, but they generally fall into twoExpand
On resampling methods for model assessment in penalized and unpenalized logistic regression
Penalized logistic regression methods are frequently used to investigate the relationship between a binary outcome and a set of explanatory variables. The model performance can be assessed byExpand
Standard Binary Logistic Regression Model
The logistic regression model is a type of predictive model that can be used when the response variable is binary—for example: live/die; disease/no disease; purchase/no purchase; win/lose. In short,Expand
Discrimination slope and integrated discrimination improvement - properties, relationships and impact of calibration.
It is demonstrated that simple recalibration ascertaining calibration in-the-large and calibration slope equal to 1 are not sufficient to correct for some forms of mis-calibration, and it is concluded that R-squared metrics, including the discrimination slope, offer an attractive choice for quantifying model performance as long as one accounts for their sensitivity to model calibration. Expand
Classification Using A Functional Partial Least Squares Logistic Regression Method
Statistical analysis of functional data has been explored extensively over the last decade and functional partial least squares regression has emerged as a popular choice for classification problems.Expand
A new Gini correlation between quantitative and qualitative variables
We propose a new Gini correlation to measure dependence between a categorical and numerical variables. Analogous to Pearson $R^2$ in ANOVA model, the Gini correlation is interpreted as the ratio ofExpand
A general and simple method for obtaining R2 from generalized linear mixed-effects models
Summary The use of both linear and generalized linear mixed-effects models (LMMs and GLMMs) has become popular not only in social and medical sciences, but also in biological sciences, especiallyExpand
Incorporating the Multinomial Logistic Regression in Vehicle Crash Severity Modeling: A Detailed Overview
Multinomial logistic regression (MNL) is an attractive statistical approach in modeling the vehicle crash severity as it does not require the assumption of normality, linearity, or homoscedasticityExpand


Coefficients of Determination for Multiple Logistic Regression Analysis
Abstract Coefficients of determination for continuous predicted values (R 2 analogs) in logistic regression are examined for their conceptual and mathematical similarity to the familiar R 2 statisticExpand
An R-squared measure of goodness of fit for some common nonlinear regression models
Abstract For regression models other than the linear model, R -squared type goodness-of-fit summary statistics have been constructed for particular models using a variety of methods. We propose an RExpand
Summarizing the predictive power of a generalized linear model.
This paper studies summary measures of the predictive power of a generalized linear model, paying special attention to a generalization of the multiple correlation coefficient from ordinary linearExpand
A Comment on the Coefficient of Determination for Binary Responses
Abstract Linear logistic or probit regression can be closely approximated by an unweighted least squares analysis of the regression linear in the conditional probabilities provided that theseExpand
A comparison of goodness-of-fit tests for the logistic regression model.
An examination of the performance of the tests when the correct model has a quadratic term but a model containing only the linear term has been fit shows that the Pearson chi-square, the unweighted sum-of-squares, the Hosmer-Lemeshow decile of risk, the smoothed residual sum- of-Squares and Stukel's score test, have power exceeding 50 per cent to detect moderate departures from linearity. Expand
Applied Logistic Regression
Applied Logistic Regression, Third Edition provides an easily accessible introduction to the logistic regression model and highlights the power of this model by examining the relationship between a dichotomous outcome and a set of covariables. Expand
On Measures of Monotone Association
Abstract A family of coefficients for measuring monotone association is presented. These include measures of association of ordinal or interval variables such as gamma of Goodman and Kruskal,Expand
A note on a general definition of the coefficient of determination
SUMMARY A generalization of the coefficient of determination R2 to general regression models is discussed. A modification of an earlier definition to allow for discrete models is proposed.
The analysis of binary data
  • D. Cox
  • Computer Science, Mathematics
  • 1970
Binary response variables special logistical analyses some complications some related approaches more complex responses. Appendices: Theoretical background Choice of explanatory variables in multipleExpand
Applied Logistic Regression (2nd ed.)
  • Wiley Series in Probability and Statistics,
  • 2000