# Methodology Review: Estimation of Population Validity and Cross-Validity, and the Use of Equal Weights in Prediction

@article{Raju1997MethodologyRE, title={Methodology Review: Estimation of Population Validity and Cross-Validity, and the Use of Equal Weights in Prediction}, author={Nambury S. Raju and Reyhan Bilgiç and Jack E. Edwards and Paul F. Fleer}, journal={Applied Psychological Measurement}, year={1997}, volume={21}, pages={291 - 305} }

In multiple regression, optimal linear weights are obtained using an ordinary least squares (OLS) procedure. However, these linear weighted combinations of predictors may not optimally predict the same criterion in the population from which the sample was drawn (population validity) or other samples drawn from the same population (population cross-validity). To achieve more accurate estimates of population validity and population cross-validity, some researchers and practitioners use formulas…

## 72 Citations

Accuracy of Population Validity and Cross-Validity Estimation: An Empirical Comparison of Formula-Based, Traditional Empirical, and Equal Weights Procedures

- Mathematics
- 1999

An empirical monte carlo study was performed using predictor and criterion data from 84,808 U.S. Air Force enlistees. 501 samples were drawn for each of seven sample size conditions: 25, 40, 60, 80,…

Exact Analysis of Squared Cross-Validity Coefficient in Predictive Regression Models

- Mathematics, MedicineMultivariate behavioral research
- 2009

All the currently available exact methods for interval estimation, power calculation, and sample size determination of the squared multiple correlation coefficient are naturally modified and extended to the analysis of the squares cross-validity coefficient.

Estimates of cross-validity for stepwise regression and with predictor selection

- Mathematics
- 1999

The effects of preselection of predictors (e.g., stepwise regression) on formula estimates of cross-validity were examined. Three actual data sets were used to generate populations of varying sample…

Confidence Intervals for Comparison of the Squared Multiple Correlation Coefficients of Non-nested Models

- Mathematics
- 2012

Multiple linear regression analysis is used widely to evaluate how an outcome or responsevariable is related to a set of predictors. Once a final model is specified, the interpretation of predictors…

DEVELOPMENTS IN THE CRITERION‐RELATED VALIDATION OF SELECTION PROCEDURES: A CRITICAL REVIEW AND RECOMMENDATIONS FOR PRACTICE

- Psychology
- 2008

The use of validated employee selection and promotion procedures is critical to workforce productivity and to the legal defensibility of the personnel decisions made on the basis of those procedures.…

Determining Sample Size for Accurate Estimation of the Squared Multiple Correlation Coefficient

- Medicine, MathematicsMultivariate behavioral research
- 2000

A comparison of the sample sizes reported here with those needed to test the hypothesis of no relationship between the predictor and criterion variables demonstrates the need for researchers to consider the purpose of their research and what is to be reported when determining the sample size for the study.

Sample size requirements for interval estimation of the strength of association effect sizes in multiple regression analysis.

- Physics, MedicinePsicothema
- 2013

The simulation results showed that the sample size procedures proposed by Bonett and Wright for precise interval estimation of the squared multiple correlation coefficient showed that their simple method for attaining the desired precision of expected width provides satisfactory results only when sample sizes are large.

Cross-Validation Sample Sizes

- Mathematics
- 2000

The squared cross-validity coefficient is a measure of the predictive validity of a sample linear prediction equation. It provides a more realistic assessment of the usefulness of the equation than…

Improved Shrinkage Estimation of Squared Multiple Correlation Coefficient and Squared Cross-Validity Coefficient

- Mathematics
- 2008

The sample squared multiple correlation coefficient is widely used for describing the usefulness of a multiple linear regression model in many areas of science. In this article, the author considers…

Correlation Weights in Multiple Regression

- Mathematics
- 2010

A general theory on the use of correlation weights in linear prediction has yet to be proposed. In this paper we take initial steps in developing such a theory by describing the conditions under…

## References

SHOWING 1-10 OF 55 REFERENCES

Use of Empirical Estimates of Shrinkage in Multiple Regression: A Caution

- Mathematics
- 1995

Empirical techniques to estimate the shrinkage of the sample R2 have been advocated as alternatives to analytical formulae. Although such techniques may be appropriate for estimating the coefficient…

Cost-Benefit Considerations in Choosing among Cross-Validation Methods.

- Mathematics
- 1984

There are two general methods of cross-validation: (a) empirical estimation, and (b) formula estimation. In choosing a specific cross-validation procedure, one should consider both costs (eg.…

Estimators of the Squared Cross-Validity Coefficient: A Monte Carlo Investigation

- Mathematics
- 1979

A monte carlo experiment was used to evaluate four procedures for estimating the population squared cross-validity of a sample least squares re gression equation. Four levels of population squared…

Estimation of the Squared Cross-Validity Coefficient in the Context of Best Subset Regression

- Mathematics
- 1988

A monte carlo study was conducted to examine the performance of several strategies for estimating the squared cross-validity coefficient of a sample regres sion equation in the context of best subset…

Multiple Regression and Validity Estimation in One Sample

- Mathematics
- 1978

This study empirically investigated equations for estimating the value of the multiple correlation co efficient in the population underlying a sample and the value of the population validity…

A COMPARISON OF SINGLE SAMPLE AND CROSS‐VALIDATION METHODS FOR ESTIMATING THE MEAN SQUARED ERROR OF PREDICTION IN MULTIPLE LINEAR REGRESSION

- Mathematics
- 1975

Two procedures for estimating the mean squared error of prediction of an empirically determined linear prediction equation are examined. The method usually employed makes use of a second validation…

A Comparison of Five Variable Weighting Procedures

- Computer Science
- 1972

an estimate of the multiple correlation coefficient in the population from which the sample was drawn, and the sample beta weights are taken as estimates of the population beta weights. It is these…

The Relative Efficiency of Regression and Simple Unit Predictor Weights in Applied Differential Psychology

- Mathematics
- 1971

A very common problem in the behavioral and social sciences is the prediction of the standing of a person or thing on one variable, usually designated the criterion, from his or its standing on a…

PREDICTIVE VALIDITY OF A LINEAR REGRESSION EQUATION

- Mathematics
- 1975

The squared correlation coefficient, w2, between an empirically chosen linear function of predictors, B0 + B′x, and a criterion, y, is employed as a measure of predictive precision. This coefficient…

A predictive-validity-based procedure for choosing between regression and equal weights.

- Psychology
- 1978

Abstract The equal weighting procedure has often been found to have more predictive validity than regression. This paper makes use of estimators of the population cross-validated correlation to…