Strong Convexity Assumption Research Articles

This paper concerns a high-dimensional stochastic programming (SP) problem of minimizing a function of expected cost with a matrix argument. To this problem, one of the most widely applied solution paradigms is the sample average approximation (SAA), which uses the average cost over sampled scenarios as a surrogate to approximate the expected cost. Traditional SAA theories require the sample size to grow rapidly when the problem dimensionality increases. Indeed, for a problem of optimizing over a p-by-p matrix, the sample complexity of the SAA is given by \({\widetilde{O}}(1)\cdot \frac{p^2}{\epsilon ^2}\cdot {polylog}(\frac{1}{\epsilon })\) to achieve an \(\epsilon \)-suboptimality gap, for some poly-logarithmic function \({polylog}(\,\cdot \,)\) and some quantity \({\widetilde{O}}(1)\) independent of dimensionality p and sample size n. In contrast, this paper considers a regularized SAA (RSAA) with a low-rankness-inducing penalty. We demonstrate that, when the optimal solution to the SP is of low rank, the sample complexity of RSAA is \({\widetilde{O}}(1)\cdot \frac{p}{\epsilon ^3}\cdot {polylog}(p,\,\frac{1}{\epsilon })\), which is almost linear in p and thus indicates a substantially lower dependence on dimensionality. Therefore, RSAA can be more advantageous than SAA especially for larger scale and higher dimensional problems. Due to the close correspondence between stochastic programming and statistical learning, our results also indicate that high-dimensional low-rank matrix recovery is possible generally beyond a linear model, even if the common assumption of restricted strong convexity is completely absent.

Read full abstract

Statistical preconditioning enables fast methods for distributed large-scale empirical risk minimization problems. In this approach, multiple worker nodes compute gradients in parallel, which are then used by the central node to update the parameter by solving an auxiliary (preconditioned) smaller-scale optimization problem. The recently proposed Statistically Preconditioned Accelerated Gradient (SPAG) method [1] has complexity bounds superior to other such algorithms but requires an exact solution for computationally intensive auxiliary optimization problems at every iteration. In this paper, we propose an Inexact SPAG (InSPAG) and explicitly characterize the accuracy by which the corresponding auxiliary subproblem needs to be solved to guarantee the same convergence rate as the exact method. We build our results by first developing an inexact adaptive accelerated Bregman proximal gradient method for general optimization problems under relative smoothness and strong convexity assumptions, which may be of independent interest. Moreover, we explore the properties of the auxiliary problem in the InSPAG algorithm assuming Lipschitz third-order derivatives and strongly convexity. For such problem class, we develop a linearly convergent Hyperfast second-order method and estimate the total complexity the InSPAG method with hyperfast auxiliary problem solver. Finally, we illustrate the proposed method's practical efficiency by performing large-scale numerical experiments on logistic regression models. To the best of our knowledge, these are the first empirical results on implementing high-order methods on large-scale problems, we work with data where the dimension is of the order of 3 million, and the number of samples is 700 million. • Inexact Statistically Preconditioned Accelerated Gradient Method for large-scale distributed convex empirical risk minimization (ERM) problems. • Hyperfast second-order algorithm for minimizing strongly convex functions with Lipschitz third-order derivatives. • Inexact adaptive accelerated Bregman proximal gradient method for general optimization problems under relative smoothness and strong convexity assumptions. • Empirical evidence for the efficiency of tensor optimization methods for large-scale ERM problems.

Read full abstract

Strong Convexity Assumption Research Articles

Related Topics

Articles published on Strong Convexity Assumption

Bayesian Federated Learning with Hamiltonian Monte Carlo: Algorithm and Theory

Adaptive, Doubly Optimal No-Regret Learning in Strongly Monotone and Exp-Concave Games with Gradient Feedback

Exponential Convergence of Primal-Dual Dynamics Under General Conditions and Its Application to Distributed Optimization.

Interior point methods in optimal control

Accelerated Projection Algorithm Based on Smoothing Approximation for Distributed Nonsmooth Optimization

Out-of-sample error estimation for M-estimators with convex penalty

A proximal quasi-Newton method based on memoryless modified symmetric rank-one formula

A Decentralized Primal-Dual Method for Constrained Minimization of a Strongly Convex Function

SOLO FTRL ALGORITHM FOR PRODUCTION MANAGEMENT WITH TRANSFER PRICES

Regularized sample average approximation for high-dimensional stochastic optimization under low-rankness

A globally convergent proximal Newton-type method in nonsmooth convex optimization

Primal-dual algorithms for multi-agent structured optimization over message-passing architectures with bounded communication delays

An inexact accelerated stochastic ADMM for separable convex optimization

Hyperfast second-order local solvers for efficient statistically preconditioned distributed optimization

On the convergence of a randomized block coordinate descent algorithm for a matrix least squares problem

Alternating minimization methods for strongly convex optimization

The finite sample properties of sparse M-estimators with pseudo-observations

Stochastic proximal splitting algorithm for composite minimization

On stochastic accelerated gradient with non-strongly convexity

Stability analysis of solutions to equilibrium problems and applications in economics

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Strong Convexity Assumption Research Articles

Related Topics

Articles published on Strong Convexity Assumption

Bayesian Federated Learning with Hamiltonian Monte Carlo: Algorithm and Theory

Adaptive, Doubly Optimal No-Regret Learning in Strongly Monotone and Exp-Concave Games with Gradient Feedback

Exponential Convergence of Primal-Dual Dynamics Under General Conditions and Its Application to Distributed Optimization.

Interior point methods in optimal control

Accelerated Projection Algorithm Based on Smoothing Approximation for Distributed Nonsmooth Optimization

Out-of-sample error estimation for M-estimators with convex penalty

A proximal quasi-Newton method based on memoryless modified symmetric rank-one formula

A Decentralized Primal-Dual Method for Constrained Minimization of a Strongly Convex Function

SOLO FTRL ALGORITHM FOR PRODUCTION MANAGEMENT WITH TRANSFER PRICES

Regularized sample average approximation for high-dimensional stochastic optimization under low-rankness

A globally convergent proximal Newton-type method in nonsmooth convex optimization

Primal-dual algorithms for multi-agent structured optimization over message-passing architectures with bounded communication delays

An inexact accelerated stochastic ADMM for separable convex optimization

Hyperfast second-order local solvers for efficient statistically preconditioned distributed optimization

On the convergence of a randomized block coordinate descent algorithm for a matrix least squares problem

Alternating minimization methods for strongly convex optimization

The finite sample properties of sparse M-estimators with pseudo-observations

Stochastic proximal splitting algorithm for composite minimization

On stochastic accelerated gradient with non-strongly convexity

Stability analysis of solutions to equilibrium problems and applications in economics