Research and Application of Reinforcement Learning Based on Constraint MDP in Coal Mine

Zhao Xiao-Hu,Ma Fang-Qing,Wang Qing-Qing,Zhao Ke-Ke

doi:10.1109/csie.2009.587

Abstract

Reinforcement learning is an algorithm without model which is learning what to do--how to map situations to actions--so as to maximize a numerical reward signal. Reinforcement learning provides an available method to the systems, which are very difficult to build up accurate models around complex environment. But now many practical problems demand a maximum reward with not much cost (expense). For example, the production of coal mine is closely correlated with security in that it increases production in the limited range of security situation. On the base of Markov decision process (MDP) and reinforcement learning, the paper introduced constraint Markov decision process into reinforcement learning. The paper improved Q-learning algorithm with adding cost factor and gave a new Q-learning algorithm based on constraint MDP. Finally, according to the constraint between production and safety in coal mine, the paper made the simulation investigation about the action control of coal shearer in coal mine working face. The simulation result had verified the validity of the method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Research and Application of Reinforcement Learning Based on Constraint MDP in Coal Mine

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Application of Grey Neural Network in Analyzing Disaster Prevention and Control in Coal Mine Based on CC and RBF-DDA Algorithms
Zhiming Qu
-
Zhiming QuZhiming Qu
01 Dec 2009
01 Dec 2009

Research on Roadway Surrounding Rock Control Technology Using O-arch Combination Support Scheme
Lei Yang ... Yang Yang
-
Lei Yang, et. al.Lei Yang ... Yang Yang
01 Jan 2015
01 Jan 2015

Development, effectiveness, and deficiency of China's Coal Mine Safety Supervision System
Bing Wu ... Yu Meng
Resources Policy | VOL. 82
Bing Wu, et. al.Bing Wu ... Yu Meng
06 Apr 2023
Resources Policy | VOL. 82

How seismic has helped to change coal mining in China
Z Pu ... W Xizun
First Break | VOL. 23
Z Pu, et. al.Z Pu ... W Xizun
01 Feb 2005
First Break | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Research and Application of Reinforcement Learning Based on Constraint MDP in Coal Mine

Abstract

Talk to us

Similar Papers