Multiobjective model-free learning for robot pathfinding with environmental disturbances

Changyun Wei,Fusheng Ni

doi:10.1177/1729881419885703

Changyun Wei, Fusheng Ni

Open Access

https://doi.org/10.1177/1729881419885703

Copy DOI

Abstract

This article addresses the robot pathfinding problem with environmental disturbances, where a solution to this problem must consider potential risks inherent in an uncertain and stochastic environment. For example, the movements of an underwater robot can be seriously disturbed by ocean currents, and thus any applied control actions to the robot cannot exactly lead to the desired locations. Reinforcement learning is a formal methodology that has been extensively studied in many sequential decision-making domains with uncertainty, but most reinforcement learning algorithms consider only a single objective encoded by a scalar reward. However, the robot pathfinding problem with environmental disturbances naturally promotes multiple conflicting objectives. Specifically, in this work, the robot has to minimise its moving distance so as to save energy, and, moreover, it has to keep away from unsafe regions as far as possible. To this end, we first propose a multiobjective model-free learning framework, and then proceed to investigate an appropriate action selection strategy by improving a baseline with respect to two dimensions. To demonstrate the effectiveness of the proposed learning framework and evaluate the performance of three action selection strategies, we also carry out an empirical study in a simulated environment.

Highlights

The pathfinding problem has been extensively studied in the robot domain, where the robot has to generate a collision-free path in a given or an unknown environment.[1,2] In this work, we seek to address the robot pathfinding problem with environmental disturbances
D2: the learning agent should pay more attention to potentially promising actions so that the learning process can quickly coverage to the optimal policy); and an empirical study to demonstrate the effectiveness of the proposed model-free learning framework, as well as to evaluate the performance of three action selection strategies in a simulated environment
We focus on the robot pathfinding problem in an initially unknown and stochastic environment with conflicting objectives

Summary

Introduction

A multiobjective model-free learning framework that can handle multiple conflicting objectives; an action selection strategy for the pathfinding problem with respect to two dimensions D2: the learning agent should pay more attention to potentially promising actions so that the learning process can quickly coverage to the optimal policy); and an empirical study to demonstrate the effectiveness of the proposed model-free learning framework, as well as to evaluate the performance of three action selection strategies in a simulated environment.

Related work

Ãðs0ÞÃ ð3Þ

6: Update

16: Update

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Advanced Robotic Systems	Publication Date: Nov 1, 2019
Citations: 2	License type: cc-by

R Discovery Prime

R Discovery Prime

Multiobjective model-free learning for robot pathfinding with environmental disturbances

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Advanced Robotic Systems

Lead the way for us

Similar Papers

A Heuristic Reinforcement Learning Based on State Backtracking Method
Min Fang ... Xiaosong Zhang
-
Min Fang, et. al.Min Fang ... Xiaosong Zhang
01 Dec 2012
01 Dec 2012

Event-Triggered and Time-Triggered Duration Calculus for Model-Free Reinforcement Learning
Kalyani Dole ... Shankaranarayanan Krishna
-
Kalyani Dole, et. al.Kalyani Dole ... Shankaranarayanan Krishna
01 Dec 2021
01 Dec 2021

Event-Triggered and Time-Triggered Duration Calculus for Model-Free Reinforcement Learning
...
-
, et. al. ...
16 Jan 2021
16 Jan 2021

Reinforcement learning algorithms: A brief survey
Ashish Kumar Shakya ... Gopinatha Pillai
Expert Systems with Applications | VOL. 231
Ashish Kumar Shakya, et. al.Ashish Kumar Shakya ... Gopinatha Pillai
01 Nov 2023
Expert Systems with Applications | VOL. 231

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multiobjective model-free learning for robot pathfinding with environmental disturbances

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Advanced Robotic Systems