Abstract

This work presents a reinforcement learning approach for intelligent decision-making of a Cutter Suction Dredger (CSD), which is a special type of vessel for deepening harbors, constructing ports or navigational channels, and reclaiming landfills. Currently, CSDs are usually controlled by human operators, and the production rate is mainly determined by the so-called cutting process (i.e., cutting the underwater soil into fragments). Long-term manual operation is likely to cause driving fatigue, resulting in operational accidents and inefficiencies. To reduce the labor intensity of the operator, we seek an intelligent controller the can manipulate the cutting process to replace human operators. To this end, our proposed reinforcement learning approach consists of two parts. In the first part, we employ a neural network model to construct a virtual environment based on the historical dredging data. In the second part, we develop a reinforcement learning model that can lean the optimal control policy by interacting with the virtual environment to obtain human experience. The results show that the proposed learning approach can successfully imitate the dredging behavior of an experienced human operator. Moreover, the learning approach can outperform the operator in a way that can make quick responses to the change in uncertain environments.

Highlights

  • The Cutter Suction Dredger (CSD) is a special vessel designed for the maintenance of rivers, lakes, or ports

  • Afterwards, we present the Reinforcement Learning (RL) model to learn the optimal control policy by interacting with the virtual environment, and a reward scheme is designed to guarantee that the state transitions are safe for the dredging operation

  • To evaluate the control policy, the human and the learning system will start from the same initial state and control the cutting process, respectively

Read more

Summary

Introduction

The Cutter Suction Dredger (CSD) is a special vessel designed for the maintenance of rivers, lakes, or ports. The CSD is usually controlled by human operators, and it is a daunting challenge for them to maintain a good production rate during the so-called cutting process. This is because the geological conditions of the working site are uncertain and dynamic for the operators. Afterwards, we present the RL model to learn the optimal control policy by interacting with the virtual environment, and a reward scheme is designed to guarantee that the state transitions are safe for the dredging operation.

Cutting Process Analysis
General Layout of a CSD
Problem Statement
Problem Formulation
Neural Network Model to Construct Virtual Environment
Representation of State Space
Representation of Action Space
State Transitions in Neural Networks
Reinforcement Learning to Find Optimal Control Policy
Reward Scheme
Model-Free Learning Algorithm
Evaluation and Results
Prediction of the Dynamics of the Environment
Intelligent Control of the Cutting Process
Results and Analysis
Comparison of Production
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call