Step-size Rule Research Articles

In this paper, a nonsmooth optimization method for locally Lipschitz functions on real algebraic varieties is developed. To this end, the set-valued map $\varepsilon$-conditional subdifferential $x\to \partial_{\varepsilon}^{N} f(x):= \partial_{\varepsilon}f(x)+N(x)$ is introduced, where $\partial_{\varepsilon}f(x)$ is the Goldstein-$\varepsilon$-subdifferential and $N(x)$ is a closed convex cone at $x$. It is proved that negative of the shortest $\varepsilon$-conditional subgradient provides a descent direction in $T(x)$, which denotes the polar of $N(x)$. The $\varepsilon$-conditional subdifferential at an iterate $x_{\ell}$ can be approximated by a convex hull of a finite set of projected gradients at sampling points in $x_\ell+\varepsilon_{\ell} B_{T(x_{\ell})}(0,1)$ to $T(x_{\ell})$, where $T(x_{\ell})$ is a linear space in the Bouligand tangent cone and $ B_{T(x_{\ell})}(0,1)$ denotes the unit ball in $T(x_{\ell})$. The negative of the shortest vector in this convex hull is shown to be a descent direction in the Bouligand tangent cone at $x_{\ell}$. The proposed algorithm makes a step along this descent direction with a certain step-size rule, followed by a retraction to lift back to points on the algebraic variety $\mathcal{M}$. The convergence of the resulting algorithm to a critical point is proved. For numerical illustration, the considered method is applied to some nonsmooth problems on varieties of low-rank matrices $\mathcal{M}_{\leq r}$ of real $M\times N$ matrices of rank at most $r$, specifically robust low-rank matrix approximation and recovery in the presence of outliers.

Read full abstract

The incremental gradient method is a prominent algorithm for minimizing a finite sum of smooth convex functions, used in many contexts including large-scale data processing applications and distributed optimization over networks. It is a first-order method that processes the functions one at a time based on their gradient information. The incremental Newton method, on the other hand, is a second-order variant which exploits additionally the curvature information of the underlying functions and can therefore be faster. In this paper, we focus on the case when the objective function is strongly convex and present fast convergence results for the incremental gradient and incremental Newton methods under the constant and diminishing stepsizes. For a decaying stepsize rule $\alpha_k = \Theta(1/k^s)$ with $s \in (0,1]$, we show that the distance of the IG iterates to the optimal solution converges at rate ${\cal O}(1/k^{s})$ (which translates into ${\cal O}(1/k^{2s})$ rate in the suboptimality of the objective value). For $s>1/2$, this improves the previous ${\cal O}(1/\sqrt{k})$ results in distances obtained for the case when functions are non-smooth. We show that to achieve the fastest ${\cal O}(1/k)$ rate, incremental gradient needs a stepsize that requires tuning to the strong convexity parameter whereas the incremental Newton method does not. The results are based on viewing the incremental gradient method as a gradient descent method with gradient errors, devising efficient upper bounds for the gradient error to derive inequalities that relate distances of the consecutive iterates to the optimal solution and finally applying Chung's lemmas from the stochastic approximation literature to these inequalities to determine their asymptotic behavior. In addition, we construct examples to show tightness of our rate results.

Read full abstract

Step-size Rule Research Articles

Related Topics

Articles published on Step-size Rule

A linearly convergent doubly stochastic Gauss–Seidel algorithm for solving linear equations and a certain class of over-parameterized optimization problems

Non-stationary Douglas–Rachford and alternating direction method of multipliers: adaptive step-sizes and convergence

Modified extragradient-like algorithms with new stepsizes for variational inequalities

A Gradient Sampling Method on Algebraic Varieties and Application to Nonsmooth Low-Rank Optimization

Extremum Seeking Algorithms based on Non-Commutative Maps

Convergence Rate of Incremental Gradient and Incremental Newton Methods

Self-adaptive iterative method for solving boundedly Lipschitz continuous and strongly monotone variational inequalities

Tseng type methods for solving inclusion problems and its applications

A convergence analysis of the method of codifferential descent

Membership overlay design optimization with resource constraints (accelerated on GPU)

String-averaging incremental stochastic subgradient algorithms

Totally relaxed, self-adaptive algorithm for solving variational inequalities over the intersection of sub-level sets

Abstract convergence theorem for quasi-convex optimization problems with applications

Strong convergence of a double projection-type method for monotone variational inequalities in Hilbert spaces

An Incremental Subgradient Method on Riemannian Manifolds

Decentralized Frank–Wolfe Algorithm for Convex and Nonconvex Problems

Weak and strong convergence theorems for variational inequality problems

On the convergence of s-dependent GFR conjugate gradient method for unconstrained optimization

On the worst-case evaluation complexity of non-monotone line search algorithms

Numerical algorithms for scatter-to-attenuation reconstruction in PET: empirical comparison of convergence, acceleration, and the effect of subsets.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Step-size Rule Research Articles

Related Topics

Articles published on Step-size Rule

A linearly convergent doubly stochastic Gauss–Seidel algorithm for solving linear equations and a certain class of over-parameterized optimization problems

Non-stationary Douglas–Rachford and alternating direction method of multipliers: adaptive step-sizes and convergence

Modified extragradient-like algorithms with new stepsizes for variational inequalities

A Gradient Sampling Method on Algebraic Varieties and Application to Nonsmooth Low-Rank Optimization

Extremum Seeking Algorithms based on Non-Commutative Maps

Convergence Rate of Incremental Gradient and Incremental Newton Methods

Self-adaptive iterative method for solving boundedly Lipschitz continuous and strongly monotone variational inequalities

Tseng type methods for solving inclusion problems and its applications

A convergence analysis of the method of codifferential descent

Membership overlay design optimization with resource constraints (accelerated on GPU)

String-averaging incremental stochastic subgradient algorithms

Totally relaxed, self-adaptive algorithm for solving variational inequalities over the intersection of sub-level sets

Abstract convergence theorem for quasi-convex optimization problems with applications

Strong convergence of a double projection-type method for monotone variational inequalities in Hilbert spaces

An Incremental Subgradient Method on Riemannian Manifolds

Decentralized Frank–Wolfe Algorithm for Convex and Nonconvex Problems

Weak and strong convergence theorems for variational inequality problems

On the convergence of s-dependent GFR conjugate gradient method for unconstrained optimization

On the worst-case evaluation complexity of non-monotone line search algorithms

Numerical algorithms for scatter-to-attenuation reconstruction in PET: empirical comparison of convergence, acceleration, and the effect of subsets.