Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems Part 2—Applications in Transportation, Industries, Communications and Networking and More Topics

Xuanchen Xiang,Simon Foo,Huanyu Zang

doi:10.3390/make3040043

Abstract

The two-part series of papers provides a survey on recent advances in Deep Reinforcement Learning (DRL) for solving partially observable Markov decision processes (POMDP) problems. Reinforcement Learning (RL) is an approach to simulate the human’s natural learning process, whose key is to let the agent learn by interacting with the stochastic environment. The fact that the agent has limited access to the information of the environment enables AI to be applied efficiently in most fields that require self-learning. It’s essential to have an organized investigation—we can make good comparisons and choose the best structures or algorithms when applying DRL in various applications. The first part of the overview introduces Markov Decision Processes (MDP) problems and Reinforcement Learning and applications of DRL for solving POMDP problems in games, robotics, and natural language processing. In part two, we continue to introduce applications in transportation, industries, communications and networking, etc. and discuss the limitations of DRL.

Highlights

Reinforcement Learning (RL) is an approach to simulate the human’s natural learning process, whose key is to let the agent learn by interacting with the stochastic environment
Navigation is a fundamental task in autonomous driving, and Deep Reinforcement Learning (DRL) has been proven to be effective in navigation problems: Fayjie et al [33] presented a Deep Q Networks (DQN)-based approach for navigation in the urban environment, and Isele et al [34] used a DQN-based method for navigating in occluded intersections
Mobile Edge Computing (MEC) is a promising technology to extend the services to the edge of the Internet of Things (IoT) system, and DRL has been successfully applied in the MEC networks in recent years [74,75,76]

Summary

Transportation

An intelligent transportation system (ITS) [1] is an application that aims to provide safe, efficient, and innovative services to transport and traffic management and construct more intelligent transport networks. The first format is an image-like representation called Discrete Traffic State Encoding (DTSE) It acquires high resolution and practical information from the intersection. Genders and Razavi [14] proposed the discrete traffic state encoding, which is informationdense, as the input to the DQN networks for traffic signal control agent (DQTSCA) and evaluated state representations from low to high-resolution using Asynchronous Advantage Actor Critic (A3C) in [15]. Xu et al [24] used a data-driven approach to find critical nodes, which can cause a reduction in traffic efficiency They introduced a policy gradient method on these nodes. In 2020, Haydari and Yilmaz [2] provided tables of outlines of single and multiple agent RL approaches for Traffic Signal Control (TSC), DRL methods for TSC, and DRL solutions for other ITS applications

Autonomous Driving

Other Applications in ITS

Industrial Applications

Smart Grid

Communications and Networking

Connected Vehicles

Resources Management

Healthcare

Education

Finance

Aerospace

Deep Reinforcement Learning Limitations

Summary

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Machine Learning and Knowledge Extraction	Publication Date: Oct 28, 2021
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems Part 2—Applications in Transportation, Industries, Communications and Networking and More Topics

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Machine Learning and Knowledge Extraction

Lead the way for us

Similar Papers

Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems: Part 1—Fundamentals and Applications in Games, Robotics and Natural Language Processing
Xuanchen Xiang ... Simon Foo
Machine Learning and Knowledge Extraction | VOL. 3
Xuanchen Xiang, et. al.Xuanchen Xiang ... Simon Foo
15 Jul 2021
Machine Learning and Knowledge Extraction | VOL. 3

Deep Reinforcement Learning With Modulated Hebbian Plus Q-Network Architecture.
Pawel Ladosz ... Nicholas Ketz
IEEE Transactions on Neural Networks and Learning Systems | VOL. 33
Pawel Ladosz, et. al.Pawel Ladosz ... Nicholas Ketz
01 May 2022
IEEE Transactions on Neural Networks and Learning Systems | VOL. 33

Contraction Mappings in the Theory Underlying Dynamic Programming
Eric V Denardo
SIAM Review | VOL. 9
Eric V DenardoEric V Denardo
01 Apr 1967
SIAM Review | VOL. 9

Memory-based Deep Reinforcement Learning for POMDPs
Lingheng Meng ... Rob Gorbet
-
Lingheng Meng, et. al.Lingheng Meng ... Rob Gorbet
27 Sep 2021
27 Sep 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems Part 2—Applications in Transportation, Industries, Communications and Networking and More Topics

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Machine Learning and Knowledge Extraction