Theoretical Convergence Guarantees Research Articles

The main problem of using standard optimization methods is the need to change all parameters in same-size steps, regardless of the behavior of the gradient. A more efficient way to optimize a neural network is to set adaptive step sizes for each parameter. Standard methods are based on the square roots of exponential estimates of the moments of the squares of past gradients and do not use the local variation in gradients. The paper presents methods of adaptive non-convex and belief-based optimization with a positive-negative estimate of the moments with the corresponding theoretical guarantees of convergence. These approaches allow the loss function to more accurately converge in the neighborhood of the global minimum in a smaller number of iterations. The utilization of transformed positive-negative moment estimates and an additional parameter that controls the step size allows one to avoid local extremes for achieving higher performance, compared to similar methods. The introduction of the developed algorithms into the learning process of various architectures of multimodal neural network systems for analyzing heterogeneous data has made it possible to increase the accuracy of recognizing pigmented skin lesions by 2.33 – 5.69 percentage points, compared to the original optimization methods. Multimodal neural network systems for analyzing heterogeneous dermatological data, using the proposed optimization algorithms, can be applied as a tool for auxiliary medical diagnostics, which will reduce the consumption of financial and labor resources involved in the medical industry, as well as increase the chance of early detection of pigmentary oncopathologies.

Read full abstract

Unsupervised domain adaptation (UDA) is an emerging learning paradigm that models on unlabeled datasets by leveraging model knowledge built on other labeled datasets, in which the statistical distributions of these datasets are usually not identical. Formally, UDA is to leverage knowledge from a labeled source domain to promote an unlabeled target domain. Although there have been a variety of methods proposed to address the UDA problem, most of them are dedicated to single-source-to-single-target domain, while the works on single-source-to-multitarget domain are relatively rare. Compared to the single-source domain with single-target domain scenario, the UDA from single-source domain to multitarget domain is more challenging since it needs to consider not only the relationships between the source and the target domains but also those among the target domains. To this end, this article proposes a kind of dictionary learning-based unsupervised multitarget domain adaptation method (DL-UMTDA). In DL-UMTDA, a common dictionary is constructed to correlate the single-source and multitarget domains, while individual dictionaries are designed to exploit the private knowledge for the target domains. Through learning the corresponding dictionary representation coefficients in the UDA process, the correlations from the source to the target domains as well as these potential relationships between the target domains can be effectively exploited. In addition, we design an alternating algorithm to solve the DL-UMTDA model with theoretical convergence guarantee. Finally, extensive experiments on benchmark (Office + Caltech) and real datasets (AgeDB, Morph, and CACD) validate the superiority of the proposed method.

Read full abstract

Theoretical Convergence Guarantees Research Articles

Related Topics

Articles published on Theoretical Convergence Guarantees

Scalable Clustering: Large Scale Unsupervised Learning of Gaussian Mixture Models with Outliers

Distributed Nonconvex Optimization for Control of Water Networks with Time-coupling Constraints

Decentralized Counterfactual Value with Threat Detection for Multi-Agent Reinforcement Learning in mixed cooperative and competitive environments

Cube is a good form: Hyperspectral band selection via multi-dimensional and high-order structure preserved clustering

Graph-based semi-supervised learning with non-convex graph total variation regularization

A novel framework for online supervised learning with feature selection

Constrained Bayesian Optimization with Lower Confidence Bound

Non-convex optimization with using positive-negative moment estimation and its application for skin cancer recognition with a neural network

A Generalized Neural Diffusion Framework on Graphs

Dual auto-weighted multi-view clustering via autoencoder-like nonnegative matrix factorization

Unsupervised Multitarget Domain Adaptation With Dictionary-Bridged Knowledge Exploitation.

Variance Reduced Domain Randomization for Reinforcement Learning With Policy Gradient.

Disturbance feedback-based model predictive control in uncertain dynamic environments

Multiple Riemannian Kernel Hashing for Large-Scale Image Set Classification and Retrieval.

Feedback linearization control for uncertain nonlinear systems via generative adversarial networks

Randomized low rank approximation for nonnegative pure quaternion matrices

Image reconstruction through compressive sampling matching pursuit and curvelet transform

Global and local similarity learning in multi-kernel space for nonnegative matrix factorization

Deep Reinforcement Learning‐Based Air Defense Decision‐Making Using Potential Games

A Derivative-Incorporated Adaptive Gradient Method for Federated Learning

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Theoretical Convergence Guarantees Research Articles

Related Topics

Articles published on Theoretical Convergence Guarantees

Scalable Clustering: Large Scale Unsupervised Learning of Gaussian Mixture Models with Outliers

Distributed Nonconvex Optimization for Control of Water Networks with Time-coupling Constraints

Decentralized Counterfactual Value with Threat Detection for Multi-Agent Reinforcement Learning in mixed cooperative and competitive environments

Cube is a good form: Hyperspectral band selection via multi-dimensional and high-order structure preserved clustering

Graph-based semi-supervised learning with non-convex graph total variation regularization

A novel framework for online supervised learning with feature selection

Constrained Bayesian Optimization with Lower Confidence Bound

Non-convex optimization with using positive-negative moment estimation and its application for skin cancer recognition with a neural network

A Generalized Neural Diffusion Framework on Graphs

Dual auto-weighted multi-view clustering via autoencoder-like nonnegative matrix factorization

Unsupervised Multitarget Domain Adaptation With Dictionary-Bridged Knowledge Exploitation.

Variance Reduced Domain Randomization for Reinforcement Learning With Policy Gradient.

Disturbance feedback-based model predictive control in uncertain dynamic environments

Multiple Riemannian Kernel Hashing for Large-Scale Image Set Classification and Retrieval.

Feedback linearization control for uncertain nonlinear systems via generative adversarial networks

Randomized low rank approximation for nonnegative pure quaternion matrices

Image reconstruction through compressive sampling matching pursuit and curvelet transform

Global and local similarity learning in multi-kernel space for nonnegative matrix factorization

Deep Reinforcement Learning‐Based Air Defense Decision‐Making Using Potential Games

A Derivative-Incorporated Adaptive Gradient Method for Federated Learning