SHP-VI Method of Solving DEC-POMDP Problem

Xiao Ping Wan,Shu Yu Li

doi:10.4028/www.scientific.net/amr.926-930.3245

Abstract

DEC-POMDP(Distributed Partially Observable Markov Decision Process) model is a multi-agent model of collaborative decision-making is important, but due to an alarming number of DEC-POMDP problem state space and great strategy solution space, so DEC-POMDP solution of the problem becomes very difficult. The agent from the initial state to the target state during the interaction with the environment, the system's maximum benefit is often only with some small amount of a higher reward states. This article by searching from the initial belief state to the target state to get a shortest Hamiltonian path, according to the corresponding sequence of actions on the path forward search to get faith belief state space trajectory, and then along the trajectory reverse convictions value function iteration, thus forming the state with the largest gains beliefs trajectory corresponding optimal strategy. In this paper, shortest Hamiltonian path-based value iteration to search the optimal path of faith so as to solve the state Hamiltonian larger DEC-POMDP problem.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SHP-VI Method of Solving DEC-POMDP Problem

Abstract

Talk to us

Similar Papers

More From: Advanced Materials Research

Lead the way for us

Journal: Advanced Materials Research	Publication Date: May 1, 2014
Citations: 1

Similar Papers

Mitigating Nonattendance Using Clinic-Resourced Incentives Can Be Mutually Beneficial: A Contingency Management-Inspired Partially Observable Markov Decision Process Model
Yunxiang Bai ... Bjorn P Berg
Value in Health | VOL. 24
Yunxiang Bai, et. al.Yunxiang Bai ... Bjorn P Berg
28 Jun 2021
Value in Health | VOL. 24

A Discrete Partially Observable Markov Decision Process Model for the Maintenance Optimization of Oil and Gas Pipelines
Ezra Wari ... Weihang Zhu
Algorithms | VOL. 16
Ezra Wari, et. al.Ezra Wari ... Weihang Zhu
12 Jan 2023
Algorithms | VOL. 16

A reactive power optimization partially observable Markov decision process with data uncertainty using multi-agent actor-attention-critic algorithm
Yaru Gu ... Xueliang Huang
International Journal of Electrical Power & Energy Systems | VOL. 147
Yaru Gu, et. al.Yaru Gu ... Xueliang Huang
05 Dec 2022
International Journal of Electrical Power & Energy Systems | VOL. 147

Decision making framework for autonomous vehicle navigation
Augie Widyotriatmo ... Keum-Shik Hong
-
Augie Widyotriatmo, et. al.Augie Widyotriatmo ... Keum-Shik Hong
01 Aug 2008
01 Aug 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SHP-VI Method of Solving DEC-POMDP Problem

Abstract

Talk to us

Similar Papers

More From: Advanced Materials Research