Improved Shrinkage Estimation of Squared Multiple Correlation Coefficient and Squared Cross-Validity Coefficient

  title={Improved Shrinkage Estimation of Squared Multiple Correlation Coefficient and Squared Cross-Validity Coefficient},
  author={Gwowen Shieh},
  journal={Organizational Research Methods},
  pages={387 - 407}
  • G. Shieh
  • Published 1 April 2008
  • Mathematics
  • Organizational Research Methods
The sample squared multiple correlation coefficient is widely used for describing the usefulness of a multiple linear regression model in many areas of science. In this article, the author considers the problem of estimating the squared multiple correlation coefficient and the squared cross-validity coefficient under the assumption that the response and predictor variables have a joint multinormal distribution. Detailed numerical investigations are conducted to assess the exact bias and mean… 

Figures and Tables from this paper

Exact Analysis of Squared Cross-Validity Coefficient in Predictive Regression Models
  • G. Shieh
  • Business
    Multivariate behavioral research
  • 2009
All the currently available exact methods for interval estimation, power calculation, and sample size determination of the squared multiple correlation coefficient are naturally modified and extended to the analysis of the squares cross-validity coefficient.
Estimation of the simple correlation coefficient
  • G. Shieh
  • Mathematics
    Behavior research methods
  • 2010
The results reveal specific situations in which the sample correlation coefficient performs better than the unbiased and nearly unbiased estimators, facilitating recommendation of r as an effect size index for the strength of linear association between two variables.
Investigating bias in squared regression structure coefficients
Using data from a Monte Carlo simulation, this study found that squared regression structure coefficients corrected with Pratt's formula produced less biased estimates and might be more accurate and stable estimates of population squared regressionructure coefficients than estimates with no such corrections.
Sample size requirements for interval estimation of the strength of association effect sizes in multiple regression analysis.
The simulation results showed that the sample size procedures proposed by Bonett and Wright for precise interval estimation of the squared multiple correlation coefficient showed that their simple method for attaining the desired precision of expected width provides satisfactory results only when sample sizes are large.
Improving on Adjusted R-Squared
  • J. Karch
  • Mathematics
    Collabra: Psychology
  • 2020
The amount of variance explained is widely reported for quantifying the model fit of a multiple linear regression model. The default adjusted R-squared estimator has the disadvantage of not being
A Cautionary Note on the Finite Sample Behavior of Maximal Reliability
It is demonstrated that the composite reliability estimator for equally weighted composite exhibits substantially less bias, which makes it a more appropriate choice for the small-sample case.
Further Consideration of the Use of Narrow Trait Scales
This study examined whether inclusion of narrow personality trait scales improved prediction of job performance enough to overcome increased capitalization on chance characteristics of the sample.
Standardized Regression Coefficients and Newly Proposed Estimators for [Formula: see text] in Multiply Imputed Data.
Simulations show that the proposed pooled standardized coefficient estimates are less biased than two earlier proposed pooled estimates, and that their 95% confidence intervals produce coverage close to the theoretical 95%.
Choosing the Best Correction Formula for the Pearson r 2Effect Size
In the present Monte Carlo simulation study, the authors compared bias and precision of 7 sampling error corrections to the Pearson r 2 under 6 × 3 × 6 conditions (i.e., population ρ values of 0.0,
Standardized Regression Coefficients and Newly Proposed Estimators for $${R}^{{2}}$$R2 in Multiply Imputed Data
Whenever statistical analyses are applied to multiply imputed datasets, specific formulas are needed to combine the results into one overall analysis, also called combination rules. In the context of


Estimating R 2 Shrinkage in Multiple Regression: A Comparison of Different Analytical Methods
Abstract The effectiveness of various analytical formulas for estimating R 2 shrinkage in multiple regression analysis was investigated. Two categories of formulas were identified: estimators of the
Cross-Validation Sample Sizes
The squared cross-validity coefficient is a measure of the predictive validity of a sample linear prediction equation. It provides a more realistic assessment of the usefulness of the equation than
The squared correlation coefficient, w2, between an empirically chosen linear function of predictors, B0 + B′x, and a criterion, y, is employed as a measure of predictive precision. This coefficient
Confidence Intervals, Power Calculation, and Sample Size Estimation for the Squared Multiple Correlation Coefficient under the Fixed and Random Regression Models: A Computer Program and Useful Standard Tables
In this article, the authors introduce a computer package written for Mathematica, the purpose of which is to perform a number of difficult iterative functions with respect to the squared multiple
Multiple Regression and Validity Estimation in One Sample
This study empirically investigated equations for estimating the value of the multiple correlation co efficient in the population underlying a sample and the value of the population validity
Methodology Review: Estimation of Population Validity and Cross-Validity, and the Use of Equal Weights in Prediction
In multiple regression, optimal linear weights are obtained using an ordinary least squares (OLS) procedure. However, these linear weighted combinations of predictors may not optimally predict the
Exact Interval Estimation, Power Calculation, and Sample Size Determination in Normal Correlation Analysis
This paper considers the problem of analysis of correlation coefficients from a multivariate normal population. A unified theorem is derived for the regression model with normally distributed
A New Maximum Likelihood Estimator for the Population Squared Multiple Correlation
Using maximum likelihood estimation as described by R. A. Fisher (1912), a new estimator for the population squared multiple correlation was developed. This estimator ( ρcirc;2 (ML) ) was derived by
Sample Size Tables for Correlation Analysis with Applications in Partial Correlation and Multiple Regression Analysis
SAS and SPSS computer programs are made available to permit researchers to select sample size for levels of accuracy, probabilities, and parameter values and for Type I error rates other than those used in constructing the tables.
Accuracy of Population Validity and Cross-Validity Estimation: An Empirical Comparison of Formula-Based, Traditional Empirical, and Equal Weights Procedures
An empirical monte carlo study was performed using predictor and criterion data from 84,808 U.S. Air Force enlistees. 501 samples were drawn for each of seven sample size conditions: 25, 40, 60, 80,