Convex Loss Research Articles

This research paper presents an innovative approach to gradient descent known as ‘‘Sample Gradient Descent’’. This method is a modification of the conventional batch gradient descent algorithm, which is often associated with space and time complexity issues. The proposed approach involves the selection of a representative sample of data, which is subsequently subjected to batch gradient descent. The selection of this sample is a crucial task, as it must accurately represent the entire dataset. To achieve this, the study employs the use of Principle Component Analysis (PCA), which is applied to the training data, with a condition that only those rows and columns of data that explain 90% of the overall variance are retained. This approach results in a convex loss function, where a global minimum can be readily attained. Our results indicate that the proposed method offers faster convergence rates, with reduced computation times, when compared to the conventional batch gradient descent algorithm. These findings demonstrate the potential utility of the ‘‘Sample Gradient Descent’’ technique in various domains, ranging from machine learning to optimization problems. In our experiments, both approaches were run for 30 epochs, with each epoch taking approximately 3.41 s. Notably, our ‘‘Sample Gradient Descent’’ approach exhibited remarkable performance, converging in just 8 epochs, while the conventional batch gradient descent algorithm required 20 epochs to achieve convergence. This substantial difference in convergence rates, along with reduced computation times, highlights the superior efficiency of our proposed method. These findings underscore the potential utility of the ‘‘Sample Gradient Descent’’ technique across diverse domains, ranging from machine learning to optimization problems. The significant improvements in convergence rates and computation times make our algorithm particularly appealing to practitioners and researchers seeking enhanced efficiency in gradient descent optimization.

In this paper, we study the Differentially Private Empirical Risk Minimization (DP-ERM) problem, considering both convex and non-convex loss functions. For cases where DP-ERM involves smooth (strongly) convex loss functions with or without non-smooth regularization, we propose several novel methods. These methods achieve (near) optimal expected excess risks (i.e., utility bounds) while reducing the gradient complexity compared to existing approaches. When dealing with DP-ERM and smooth convex loss functions in high dimensions, we introduce an algorithm that achieves a superior upper bound with lower gradient complexity than previous solutions.In the second part of the paper, for DP-ERM with non-convex loss functions, we explore both low and high dimensional spaces. In the low dimensional case with a non-smooth regularizer, we extend an existing approach by measuring the utility using the ℓ2 norm of the projected gradient. Furthermore, we introduce a novel error bound measurement, transitioning from empirical risk to population risk by employing the expected ℓ2 norm of the gradient. For the high dimensional case, we demonstrate that by measuring utility with the Frank-Wolfe gap, we can bound the utility using the Gaussian Width of the constraint set, instead of the dimensionality (p) of the underlying space. We also show that the advantages of this approach can be achieved by measuring the ℓ2 norm of the projected gradient. Finally, we reveal that the utility of certain special non-convex loss functions can be reduced to a level (depending only on log⁡p) similar to that of convex loss functions.

Convex Loss Research Articles

Articles published on Convex Loss

Deterministic equivalent and error universality of deep random features learning*

FedADMM-InSa: An inexact and self-adaptive ADMM for federated learning

An error analysis for deep binary classification with sigmoid loss

Group inference of high-dimensional single-index models

Large-scale robust regression with truncated loss via majorization-minimization algorithm

Large-Scale Non-convex Stochastic Constrained Distributionally Robust Optimization

Model Change Active Learning in Graph-Based Semi-supervised Learning

High–dimensional local linear regression under sparsity and convex losses

Linearized alternating direction method of multipliers for elastic-net support vector machines

Fluctuations, bias, variance and ensemble of learners: exact asymptotics for convex losses in high-dimension *

From big data to smart data: a sample gradient descent approach for machine learning

Gradient complexity and non-stationary views of differentially private empirical risk minimization

Recurrent Neural Network Training With Convex Loss and Regularization Functions by Extended Kalman Filtering

Generalization Analysis of CNNs for Classification on Spheres.

Online distributed detection of sensor networks with delayed information

Asymptotic linear convergence of fully-corrective generalized conditional gradient methods

Distributed Projection-Free Online Learning for Smooth and Convex Losses

Meta-Inductive Probability Aggregation

Regret and Cumulative Constraint Violation Analysis for Distributed Online Constrained Convex Optimization

Clustered Federated Learning Based on Momentum Gradient Descent for Heterogeneous Data

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Convex Loss Research Articles

Articles published on Convex Loss

Deterministic equivalent and error universality of deep random features learning*

FedADMM-InSa: An inexact and self-adaptive ADMM for federated learning

An error analysis for deep binary classification with sigmoid loss

Group inference of high-dimensional single-index models

Large-scale robust regression with truncated loss via majorization-minimization algorithm

Large-Scale Non-convex Stochastic Constrained Distributionally Robust Optimization

Model Change Active Learning in Graph-Based Semi-supervised Learning

High–dimensional local linear regression under sparsity and convex losses

Linearized alternating direction method of multipliers for elastic-net support vector machines

Fluctuations, bias, variance and ensemble of learners: exact asymptotics for convex losses in high-dimension *

From big data to smart data: a sample gradient descent approach for machine learning

Gradient complexity and non-stationary views of differentially private empirical risk minimization

Recurrent Neural Network Training With Convex Loss and Regularization Functions by Extended Kalman Filtering

Generalization Analysis of CNNs for Classification on Spheres.

Online distributed detection of sensor networks with delayed information

Asymptotic linear convergence of fully-corrective generalized conditional gradient methods

Distributed Projection-Free Online Learning for Smooth and Convex Losses

Meta-Inductive Probability Aggregation

Regret and Cumulative Constraint Violation Analysis for Distributed Online Constrained Convex Optimization

Clustered Federated Learning Based on Momentum Gradient Descent for Heterogeneous Data