Abstract

Smoothed functional (SF) algorithm estimates the gradient of the stochastic optimization problem by convolution with a smoothening kernel. This process helps the algorithm to converge to a global minimum or a point close to it. We study a two-time scale SF based gradient search algorithm with Nesterov’s acceleration for stochastic optimization problems. The main contribution of our work is to prove the convergence of this algorithm using the stochastic approximation theory. We propose a novel Lyapunov function to show the associated second-order ordinary differential equations’ (o.d.e.) stability for a non-autonomous system. We compare our algorithm with other smoothed functional algorithms such as Quasi-Newton SF, Gradient SF and Jacobi Variant of Newton SF on two different optimization problems: first, on a simple stochastic function minimization problem, and second, on the problem of optimal routing in a queueing network. Additionally, we compared the algorithms on real weather data in a weather prediction task. Experimental results show that our algorithm performs significantly better than these baseline algorithms.

Highlights

  • IntroductionOptimization problems deal with minimizing (or maximizing) the value of an objective function [1]

  • Optimization problems deal with minimizing the value of an objective function [1]

  • When parameters of the objective function or the optimization algorithm have randomness, the process of optimization is termed as Stochastic Optimization (SO) [2]

Read more

Summary

Introduction

Optimization problems deal with minimizing (or maximizing) the value of an objective function [1]. When parameters of the objective function or the optimization algorithm have randomness, the process of optimization is termed as Stochastic Optimization (SO) [2] It has applications in various fields such as machine learning [3], finance, supply chain [4], network optimization [5] and optimization with information uncertainty [6], [7]. One of the most popular algorithm in this regard is Stochastic Gradient Descent (SGD) [8] It was evolved from the works of Robbins and Monro [9], and it estimates the gradients of the cost function. Assumption 5. {Z(n)} is a martingale difference sequence with respect to the increasing family of σ-fields.

Objectives
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.