Corpus ID: 88520746

Optimal Subsampling Approaches for Large Sample Linear Regression

  title={Optimal Subsampling Approaches for Large Sample Linear Regression},
  author={Rong Zhu and Ping Ma and Michael W. Mahoney and Bin Yu},
  journal={arXiv: Methodology},
A significant hurdle for analyzing large sample data is the lack of effective statistical computing and inference methods. An emerging powerful approach for analyzing large sample data is subsampling, by which one takes a random subsample from the original full sample and uses it as a surrogate for subsequent computation and estimation. In this paper, we study subsampling methods under two scenarios: approximating the full sample ordinary least-square (OLS) estimator and estimating the… Expand
A statistical perspective of sampling scores for linear regression
Minimax Linear Regression under Measurement Constraints
On Computationally Tractable Selection of Experiments in Measurement-Constrained Regression Models
Bayesian experimental design using regularized determinantal point processes
Polynomial time algorithms for dual volume sampling
Error Analysis of Generalized Nyström Kernel Regression
One Backward from Ten Forward, Subsampling for Large-Scale Deep Learning
Optimal Sampling Algorithms for Block Matrix Multiplication


Faster least squares approximation