# Variance estimation using refitted cross-validation in ultrahigh dimensional regression.

@article{Fan2012VarianceEU, title={Variance estimation using refitted cross-validation in ultrahigh dimensional regression.}, author={Jianqing Fan and Shaojun Guo and Ning Hao}, journal={Journal of the Royal Statistical Society. Series B, Statistical methodology}, year={2012}, volume={74 1}, pages={ 37-65 } }

Variance estimation is a fundamental problem in statistical modelling. In ultrahigh dimensional linear regression where the dimensionality is much larger than the sample size, traditional variance estimation techniques are not applicable. Recent advances in variable selection in ultrahigh dimensional linear regression make this problem accessible. One of the major problems in ultrahigh dimensional regression is the high spurious correlation between the unobserved realized noise and some of the…

## Figures, Tables, and Topics from this paper

## 231 Citations

Variance estimation based on blocked 3×2 cross-validation in high-dimensional linear regression

- MathematicsJournal of Applied Statistics
- 2020

In high-dimensional linear regression, the dimension of variables is always greater than the sample size. In this situation, the traditional variance estimation technique based on ordinary least…

Error Variance Estimation in Ultrahigh-Dimensional Additive Models

- Medicine, MathematicsJournal of the American Statistical Association
- 2018

An accurate estimate for error variance in ultrahigh-dimensional sparse additive model is proposed by effectively integrating sure independence screening and refitted cross-validation techniques and the root n consistency and the asymptotic normality of the resulting estimate are established.

Ultrahigh Dimensional Precision Matrix Estimation via Refitted Cross Validation.

- Medicine, Computer ScienceJournal of econometrics
- 2020

A refitted cross validation (RCV) method for sparse precision matrix estimation based on its Cholesky decomposition, which does not require the Gaussian assumption, can be easily implemented with existing software for ultrahigh dimensional linear regression.

Test for high-dimensional regression coefficients using refitted cross-validation variance estimation

- MathematicsThe Annals of Statistics
- 2018

Testing a hypothesis for high-dimensional regression coefficients is of fundamental importance in the statistical theory and applications. In this paper, we develop a new test for the overall…

Variance estimation in high-dimensional linear models

- Mathematics
- 2014

The residual variance and the proportion of explained variation are important quantities in many statistical models and model fitting procedures. They play an important role in regression diagnostics…

Generalized Fiducial Inference for Ultrahigh-Dimensional Regression

- Mathematics, Computer Science
- 2013

It is shown that statistical inference based on the proposed methods will have correct asymptotic frequentist property and also for constructing confidence intervals for the corresponding parameters of the parameter estimates and model choices.

Estimation of high dimensional mean regression in the absence of symmetry and light tail assumptions.

- Mathematics, MedicineJournal of the Royal Statistical Society. Series B, Statistical methodology
- 2017

The results reveal that the ultra-high dimensional setting, where the dimensionality can grow exponentially with the sample size, the RA-lasso estimator produces a consistent estimator at the same rate as the optimal rate under the light-tail situation.

A Study of Error Variance Estimation in Lasso Regression

- Mathematics
- 2013

Variance estimation in the linear model when $p > n$ is a difficult problem. Standard least squares estimation techniques do not apply. Several variance estimators have been proposed in the…

Estimating the error variance in a high-dimensional linear model

- MathematicsBiometrika
- 2019

The lasso has been studied extensively as a tool for estimating the coefficient vector in the high-dimensional linear model; however, considerably less is known about estimating the error variance…

Estimating the error variance in a high-dimensional linear model

- 2019

The lasso has been studied extensively as a tool for estimating the coefficient vector in the high-dimensional linear model; however, considerably less is known about estimating the error variance in…

## References

SHOWING 1-10 OF 42 REFERENCES

Sure independence screening for ultrahigh dimensional feature space

- Mathematics
- 2006

Summary. Variable selection plays an important role in high dimensional statistical modelling which nowadays appears in many areas and is key to various scientific discoveries. For problems of large…

Bootstrapping Lasso Estimators

- Mathematics
- 2011

In this article, we consider bootstrapping the Lasso estimator of the regression parameter in a multiple linear regression model. It is known that the standard bootstrap method fails to be…

The sparsity and bias of the Lasso selection in high-dimensional linear regression

- Mathematics
- 2008

Meinshausen and Buhlmann [Ann. Statist. 34 (2006) 1436-1462] showed that, for neighborhood selection in Gaussian graphical models, under a neighborhood stability condition, the LASSO is consistent,…

Ultrahigh Dimensional Feature Selection: Beyond The Linear Model

- Mathematics, MedicineJ. Mach. Learn. Res.
- 2009

This paper extends ISIS, without explicit definition of residuals, to a general pseudo-likelihood framework, which includes generalized linear models as a special case and improves ISIS by allowing feature deletion in the iterative process.

Sure independence screening in generalized linear models with NP-dimensionality

- Mathematics
- 2010

Ultrahigh dimensional variable selection plays an increasingly important role in contemporary scientific discoveries and statistical research. Among others, Fan and Lv (2008) propose an independent…

Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties

- Mathematics
- 2001

Variable selection is fundamental to high-dimensional statistical modeling, including nonparametric regression. Many approaches in use are stepwise selection procedures, which can be computationally…

p-Values for High-Dimensional Regression

- Mathematics
- 2008

Assigning significance in high-dimensional regression is challenging. Most computationally efficient selection algorithms cannot guard against inclusion of noise variables. Asymptotically valid…

Penalized regression, standard errors, and Bayesian lassos

- Mathematics
- 2010

Penalized regression methods for simultaneous variable selection and coe-cient estimation, especially those based on the lasso of Tibshirani (1996), have received a great deal of attention in recent…

The group lasso for logistic regression

- Mathematics
- 2008

The group lasso is an extension of the lasso to do variable selection on (predefined) groups of variables in linear regression models. The estimates have the attractive property of being invariant…

Nonconcave Penalized Likelihood With NP-Dimensionality

- Mathematics, Computer ScienceIEEE Transactions on Information Theory
- 2011

It is shown that in the context of generalized linear models, such methods possess model selection consistency with oracle properties even for dimensionality of nonpolynomial order of sample size, for a class of penalized likelihood approaches using folded-concave penalty functions, which were introduced to ameliorate the bias problems of convex penalty functions.