Saddle Property Research Articles

The matrix sensing problem is an important low-rank optimization problem that has found a wide range of applications, such as matrix completion, phase synchornization/retrieval, robust principal component analysis (PCA), and power system state estimation. In this work, we focus on the general matrix sensing problem with linear measurements that are corrupted by random noise. We investigate the scenario where the search rank r is equal to the true rank [Formula: see text] of the unknown ground truth (the exact parametrized case), as well as the scenario where r is greater than [Formula: see text] (the overparametrized case). We quantify the role of the restricted isometry property (RIP) in shaping the landscape of the nonconvex factorized formulation and assisting with the success of local search algorithms. First, we develop a global guarantee on the maximum distance between an arbitrary local minimizer of the nonconvex problem and the ground truth under the assumption that the RIP constant is smaller than [Formula: see text]. We then present a local guarantee for problems with an arbitrary RIP constant, which states that any local minimizer is either considerably close to the ground truth or far away from it. More importantly, we prove that this noisy, overparametrized problem exhibits the strict saddle property, which leads to the global convergence of perturbed gradient descent algorithm in polynomial time. The results of this work provide a comprehensive understanding of the geometric landscape of the matrix sensing problem in the noisy and overparametrized regime. Funding: This work was supported by grants from the National Science Foundation, Office of Naval Research, Air Force Office of Scientific Research, and Army Research Office.

Read full abstract

This paper considers general rank-constrained optimization problems that minimize a general objective function <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">${f}( {X})$ </tex-math></inline-formula> over the set of rectangular <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">${n}\times {m}$ </tex-math></inline-formula> matrices that have rank at most r. To tackle the rank constraint and also to reduce the computational burden, we factorize <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$ {X}$ </tex-math></inline-formula> into <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$ {U} {V} ^{\mathrm {T}}$ </tex-math></inline-formula> where <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$ {U}$ </tex-math></inline-formula> and <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$ {V}$ </tex-math></inline-formula> are <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">${n}\times {r}$ </tex-math></inline-formula> and <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">${m}\times {r}$ </tex-math></inline-formula> matrices, respectively, and then optimize over the small matrices <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$ {U}$ </tex-math></inline-formula> and <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$ {V}$ </tex-math></inline-formula> . We characterize the global optimization geometry of the nonconvex factored problem and show that the corresponding objective function satisfies the robust strict saddle property as long as the original objective function f satisfies restricted strong convexity and smoothness properties, ensuring global convergence of many local search algorithms (such as noisy gradient descent) in polynomial time for solving the factored problem. We also provide a comprehensive analysis for the optimization geometry of a matrix factorization problem where we aim to find <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">${n}\times {r}$ </tex-math></inline-formula> and <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">${m}\times {r}$ </tex-math></inline-formula> matrices <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$ {U}$ </tex-math></inline-formula> and <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$ {V}$ </tex-math></inline-formula> such that <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$ {U} {V} ^{\mathrm {T}}$ </tex-math></inline-formula> approximates a given matrix <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$ {X}^\star $ </tex-math></inline-formula> . Aside from the robust strict saddle property, we show that the objective function of the matrix factorization problem has no spurious local minima and obeys the strict saddle property not only for the exact-parameterization case where <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$\mathrm {rank}( {X}^\star) = {r}$ </tex-math></inline-formula> , but also for the over-parameterization case where <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$\mathrm {rank}( {X}^\star) < {r}$ </tex-math></inline-formula> and the under-parameterization case where <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$\mathrm {rank}( {X}^\star) > {r}$ </tex-math></inline-formula> . These geometric properties imply that a number of iterative optimization algorithms (such as gradient descent) converge to a global solution with random initialization.

Read full abstract

Saddle Property Research Articles

Articles published on Saddle Property

The effect of smooth parametrizations on nonconvex optimization landscapes

Linear Regularizers Enforce the Strict Saddle Property

Geometric Analysis of Noisy Low-Rank Matrix Recovery in the Exact Parametrized and the Overparametrized Regimes

Sharp Restricted Isometry Property Bounds for Low-Rank Matrix Recovery Problems with Corrupted Measurements

Local and Global Linear Convergence of General Low-Rank Matrix Recovery Problems

Proximal Methods Avoid Active Strict Saddles of Weakly Convex Functions

The Global Optimization Geometry of Low-Rank Matrix Optimization

The Global Geometry of Centralized and Distributed Low-rank Matrix Recovery Without Regularization

Global Optimality in Low-Rank Matrix Optimization

The equilibrium stability for a smooth and discontinuous oscillator with dry friction

Invariant Manifolds and Orbit Control in the Solar Sail Three-Body Problem

Investigation of saddle trajectories for cardiac CT imaging in cone-beam geometry

Saddles and dynamics in a solvable mean-field model

Quasisaddles of liquids: Computational study of a bulk Lennard-Jones system

Implementing bounds-based approximations in convex-concave two-stage stochastic programming

Extremum principles for a general class of saddle functionals

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Saddle Property Research Articles

Articles published on Saddle Property

The effect of smooth parametrizations on nonconvex optimization landscapes

Linear Regularizers Enforce the Strict Saddle Property

Geometric Analysis of Noisy Low-Rank Matrix Recovery in the Exact Parametrized and the Overparametrized Regimes

Sharp Restricted Isometry Property Bounds for Low-Rank Matrix Recovery Problems with Corrupted Measurements

Local and Global Linear Convergence of General Low-Rank Matrix Recovery Problems

Proximal Methods Avoid Active Strict Saddles of Weakly Convex Functions

The Global Optimization Geometry of Low-Rank Matrix Optimization

The Global Geometry of Centralized and Distributed Low-rank Matrix Recovery Without Regularization

Global Optimality in Low-Rank Matrix Optimization

The equilibrium stability for a smooth and discontinuous oscillator with dry friction

Invariant Manifolds and Orbit Control in the Solar Sail Three-Body Problem

Investigation of saddle trajectories for cardiac CT imaging in cone-beam geometry

Saddles and dynamics in a solvable mean-field model

Quasisaddles of liquids: Computational study of a bulk Lennard-Jones system

Implementing bounds-based approximations in convex-concave two-stage stochastic programming

Extremum principles for a general class of saddle functionals