Stochastic Recursive Gradient Algorithm for Nonconvex Optimization
@article{Nguyen2017StochasticRG, title={Stochastic Recursive Gradient Algorithm for Nonconvex Optimization}, author={Lam M. Nguyen and J. Liu and K. Scheinberg and Martin Tak{\'a}c}, journal={ArXiv}, year={2017}, volume={abs/1705.07261} }
In this paper, we study and analyze the mini-batch version of StochAstic Recursive grAdient algoritHm (SARAH), a method employing the stochastic recursive gradient, for solving empirical loss minimization for the case of nonconvex losses. We provide a sublinear convergence rate (to stationary points) for general nonconvex functions and a linear convergence rate for gradient dominated functions, both of which have some advantages compared to other modern stochastic gradient algorithms for… CONTINUE READING
Figures, Tables, and Topics from this paper
58 Citations
Complexities in Projection-Free Stochastic Non-convex Minimization
- Mathematics, Computer Science
- AISTATS
- 2019
- 16
- PDF
Stochastic Proximal Gradient Methods for Non-smooth Non-Convex Regularized Problems.
- Mathematics
- 2019
- 5
- Highly Influenced
Momentum with Variance Reduction for Nonconvex Composition Optimization
- Computer Science, Mathematics
- ArXiv
- 2020
- 2
- Highly Influenced
- PDF
On the step size selection in variance-reduced algorithm for nonconvex optimization
- Computer Science
- Expert Syst. Appl.
- 2021
A linearly convergent stochastic recursive gradient method for convex optimization
- Mathematics, Computer Science
- Optim. Lett.
- 2020
- 1
- PDF
Characterization of Convex Objective Functions and Optimal Expected Convergence Rates for SGD
- Computer Science, Mathematics
- ICML
- 2019
- 5
- PDF
Stochastic Nested Variance Reduction for Nonconvex Optimization
- Mathematics, Computer Science
- J. Mach. Learn. Res.
- 2020
- 52
- PDF
References
SHOWING 1-10 OF 28 REFERENCES
Stochastic First- and Zeroth-Order Methods for Nonconvex Stochastic Programming
- Mathematics, Computer Science
- SIAM J. Optim.
- 2013
- 646
- Highly Influential
- PDF
Stochastic Variance Reduction for Nonconvex Optimization
- Mathematics, Computer Science
- ICML
- 2016
- 374
- Highly Influential
- PDF
Cubic regularization of Newton method and its global performance
- Mathematics, Computer Science
- Math. Program.
- 2006
- 589
- PDF
SARAH: A Novel Method for Machine Learning Problems Using Stochastic Recursive Gradient
- Computer Science, Mathematics
- ICML
- 2017
- 232
- PDF
Accelerating Stochastic Gradient Descent using Predictive Variance Reduction
- Computer Science, Mathematics
- NIPS
- 2013
- 1,686
- PDF
Mini-Batch Semi-Stochastic Gradient Descent in the Proximal Setting
- Mathematics, Computer Science
- IEEE Journal of Selected Topics in Signal Processing
- 2016
- 198
- PDF
Minimizing finite sums with the stochastic average gradient
- Mathematics, Computer Science
- Math. Program.
- 2017
- 799
- Highly Influential
- PDF
Adaptive Subgradient Methods for Online Learning and Stochastic Optimization
- Computer Science, Mathematics
- J. Mach. Learn. Res.
- 2011
- 6,536
- Highly Influential
- PDF