Policy Gradient Approach of Event‐Based Optimization and Its Online Implementation

Li Xia

doi:10.1002/asjc.874

Abstract

AbstractIn the theory of event‐based optimization (EBO), the decision making is triggered by events, which is different from the traditional state‐based control in Markov decision processes (MDP). In this paper, we propose a policy gradient approach of EBO. First, an equation of performance gradient in the event‐based policy space is derived based on a fundamental quantity called Q‐factors of EBO. With the performance gradient, we can find the local optimum of EBO using the gradient‐based algorithm. Compared to the policy iteration approach in EBO, this policy gradient approach does not require restrictive conditions and it has a wider application scenario. The policy gradient approach is further implemented based on the online estimation of Q‐factors. This approach does not require the prior information about the system parameters, such as the transition probability. Finally, we use an EBO model to formulate the admission control problem and demonstrate the main idea of this paper. Such online algorithm provides an effective implementation of the EBO theory in practice.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Policy Gradient Approach of Event‐Based Optimization and Its Online Implementation

Abstract

Talk to us

Similar Papers

More From: Asian Journal of Control

Lead the way for us

Journal: Asian Journal of Control	Publication Date: Mar 21, 2014
Citations: 26

Similar Papers

Basic Ideas for Event-Based Optimization of Markov Systems
Xi-Ren Cao
Discrete Event Dynamic Systems | VOL. 15
Xi-Ren CaoXi-Ren Cao
01 Jun 2005
Discrete Event Dynamic Systems | VOL. 15

Quickest change detection approach to optimal control in Markov decision processes with model changes
Taposh Banerjee ... Miao Liu
-
Taposh Banerjee, et. al.Taposh Banerjee ... Miao Liu
01 May 2017
01 May 2017

On age of information for remote control of Markov decision processes over multiple access channels
Minha Mubarak ... B S Vineeth
-
Minha Mubarak, et. al.Minha Mubarak ... B S Vineeth
23 Feb 2023
23 Feb 2023

Local and global event-based optimization: Performace and complexity
Zijian Wu ... Xiaohong Guan
-
Zijian Wu, et. al.Zijian Wu ... Xiaohong Guan
01 Aug 2015
01 Aug 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Policy Gradient Approach of Event‐Based Optimization and Its Online Implementation

Abstract

Talk to us

Similar Papers

More From: Asian Journal of Control