Abstract Discrete gradient methods are geometric integration techniques that can preserve the dissipative structure of gradient flows. Due to the monotonic decay of the function values, they are well suited for general convex and nonconvex optimization problems. Both zero- and first-order algorithms can be derived from the discrete gradient method by selecting different discrete gradients. In this paper, we present a thorough analysis of the discrete gradient method for optimization that provides a solid theoretical foundation. We show that the discrete gradient method is well-posed by proving the existence of iterates for any positive time step, as well as uniqueness in some cases, and propose an efficient method for solving the associated discrete gradient equation. Moreover, we establish an $\text{O}(1/k)$ convergence rate for convex objectives and prove linear convergence if instead the Polyak–Łojasiewicz inequality is satisfied. The analysis is carried out for three discrete gradients—the Gonzalez discrete gradient, the mean value discrete gradient, and the Itoh–Abe discrete gradient—as well as for a randomised Itoh–Abe method. Our theoretical results are illustrated with a variety of numerical experiments, and we furthermore demonstrate that the methods are robust with respect to stiffness.
Read full abstract