Composite Convex Research Articles

Stochastic gradient descent (SGD) holds as a classical method to build large scale machine learning models over big data. A stochastic gradient is typically calculated from a limited number of samples (known as mini-batch), which potentially incurs a high variance and causes the estimated parameters to bounce around the optimal solution. To improve the stability of stochastic gradient, recent years have witnessed the proposal of several semi-stochastic gradient descent algorithms, which distinguish themselves from standard SGD by incorporating global information into gradient computation. In this paper, we contribute a novel stratified semi-stochastic gradient descent (S3GD) algorithm to this nascent research area, accelerating the optimization of a large family of composite convex functions. Though theoretically converging faster, prior semi-stochastic algorithms are found to suffer from high iteration complexity, which makes them even slower than SGD in practice on many datasets. In our proposed S3GD, the semi-stochastic gradient is calculated based on efficient manifold propagation, which can be numerically accomplished by sparse matrix multiplications. This way S3GD is able to generate a highly-accurate estimate of the exact gradient from each mini-batch with largely-reduced computational complexity. Theoretic analysis reveals that the proposed S3GD elegantly balances the geometric algorithmic convergence rate against the space and time complexities during the optimization. The efficacy of S3GD is also experimentally corroborated on several large-scale benchmark datasets.

Read full abstract

Motivated by big data applications, first-order methods have been extremely popular in recent years. However, naive gradient methods generally converge slowly. Hence, much effort has been made to accelerate various first-order methods. This paper proposes two accelerated methods towards solving structured linearly constrained convex programming, for which we assume composite convex objective that is the sum of a differentiable function and a possibly nondifferentiable one. The first method is the accelerated linearized augmented Lagrangian method (LALM). At each update to the primal variable, it allows linearization to the differentiable function and also the augmented term, and thus it enables easy subproblems. Assuming merely convexity, we show that LALM owns $O(1/t)$ convergence if parameters are kept fixed during all the iterations and can be accelerated to $O(1/t^2)$ if the parameters are adapted, where $t$ is the number of total iterations. The second method is the accelerated linearized alternating direction method of multipliers (LADMM). In addition to the composite convexity, it further assumes two-block structure on the objective. Different from classic alternating direction method of multipliers, our method allows linearization to the objective and also augmented term to make the update simple. Assuming strong convexity on one block variable, we show that LADMM also enjoys $O(1/t^2)$ convergence with adaptive parameters. This result is a significant improvement over that in [Goldstein et. al, SIAM J. Imag. Sci., 7 (2014), pp. 1588--1623], which requires strong convexity on both block variables and no linearization to the objective or augmented term. Numerical experiments are performed on quadratic programming, image denoising, and support vector machine. The proposed accelerated methods are compared to nonaccelerated ones and also existing accelerated methods. The results demonstrate the validity of acceleration and superior performance of the proposed methods over existing ones.

Read full abstract

Composite Convex Research Articles

Related Topics

Articles published on Composite Convex

Stochastic Gradient Made Stable: A Manifold Propagation Approach for Large-Scale Optimization

Accelerated First-Order Primal-Dual Proximal Methods for Linearly Constrained Composite Convex Programming

Exact Worst-Case Performance of First-Order Methods for Composite Convex Optimization

An accelerated non-Euclidean hybrid proximal extragradient-type algorithm for convex–concave saddle-point problems

A splitting primal-dual proximity algorithm for solving composite optimization problems

Approximate Optimality Conditions for Composite Convex Optimization Problems

Adaptive smoothing algorithms for nonsmooth composite convex minimization

The stable duality of DC programs for composite convex functions

MAGMA: Multilevel Accelerated Gradient Mirror Descent Algorithm for Large-Scale Convex Composite Minimization

An adaptive accelerated first-order method for convex optimization

The Stable Farkas Lemma for composite convex functions in infinite dimensional spaces

A Parallel Line Search Subspace Correction Method for Composite Convex Optimization

Accelerating Block-Decomposition First-Order Methods for Solving Composite Saddle-Point and Two-Player Nash Equilibrium Problems

A total variation based nonrigid image registration by combining parametric and non-parametric transformation models

Fast First-Order Methods for Composite Convex Optimization with Backtracking

Comments on: Farkas’ lemma: three decades of generalizations for mathematical optimization

From the Farkas Lemma to the Hahn--Banach Theorem

The Toland-Fenchel-Lagrange duality of DC programs for composite convex functions

Characterizations of Asymptotic Cone of the Solution Set of a Composite Convex Optimization Problem

An Inexact Accelerated Proximal Gradient Method for Large Scale Linearly Constrained Convex SDP

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Composite Convex Research Articles

Related Topics

Articles published on Composite Convex

Stochastic Gradient Made Stable: A Manifold Propagation Approach for Large-Scale Optimization

Accelerated First-Order Primal-Dual Proximal Methods for Linearly Constrained Composite Convex Programming

Exact Worst-Case Performance of First-Order Methods for Composite Convex Optimization

An accelerated non-Euclidean hybrid proximal extragradient-type algorithm for convex–concave saddle-point problems

A splitting primal-dual proximity algorithm for solving composite optimization problems

Approximate Optimality Conditions for Composite Convex Optimization Problems

Adaptive smoothing algorithms for nonsmooth composite convex minimization

The stable duality of DC programs for composite convex functions

MAGMA: Multilevel Accelerated Gradient Mirror Descent Algorithm for Large-Scale Convex Composite Minimization

An adaptive accelerated first-order method for convex optimization

The Stable Farkas Lemma for composite convex functions in infinite dimensional spaces

A Parallel Line Search Subspace Correction Method for Composite Convex Optimization

Accelerating Block-Decomposition First-Order Methods for Solving Composite Saddle-Point and Two-Player Nash Equilibrium Problems

A total variation based nonrigid image registration by combining parametric and non-parametric transformation models

Fast First-Order Methods for Composite Convex Optimization with Backtracking

Comments on: Farkas’ lemma: three decades of generalizations for mathematical optimization

From the Farkas Lemma to the Hahn--Banach Theorem

The Toland-Fenchel-Lagrange duality of DC programs for composite convex functions

Characterizations of Asymptotic Cone of the Solution Set of a Composite Convex Optimization Problem

An Inexact Accelerated Proximal Gradient Method for Large Scale Linearly Constrained Convex SDP