Deep Reinforcement Learning Network Research Articles

Reinforcement learning, as a branch of machine learning, has been gradually applied in the control field. However, in the practical application of the algorithm, the hyperparametric approach to network settings for deep reinforcement learning still follows the empirical attempts of traditional machine learning (supervised learning and unsupervised learning). This method ignores part of the information generated by agents exploring the environment contained in the updating of the reinforcement learning value function, which will affect the performance of the convergence and cumulative return of reinforcement learning. The reinforcement learning algorithm based on dynamic parameter adjustment is a new method for setting learning rate parameters of deep reinforcement learning. Based on the traditional method of setting parameters for reinforcement learning, this method analyzes the advantages of different learning rates at different stages of reinforcement learning and dynamically adjusts the learning rates in combination with the temporal-difference (TD) error values to achieve the advantages of different learning rates in different stages to improve the rationality of the algorithm in practical application. At the same time, by combining the Robbins–Monro approximation algorithm and deep reinforcement learning algorithm, it is proved that the algorithm of dynamic regulation learning rate can theoretically meet the convergence requirements of the intelligent control algorithm. In the experiment, the effect of this method is analyzed through the continuous control scenario in the standard experimental environment of ”Car-on-The-Hill” of reinforcement learning, and it is verified that the new method can achieve better results than the traditional reinforcement learning in practical application. According to the model characteristics of the deep reinforcement learning, a more suitable setting method for the learning rate of the deep reinforcement learning network proposed. At the same time, the feasibility of the method has been proved both in theory and in the application. Therefore, the method of setting the learning rate parameter is worthy of further development and research.

Read full abstract

Supervised object detection models require fully annotated data for training the network. However, labeling large datasets is a very time-consuming task, therefore, weakly supervised object detection (WSOD) is a substitute approach to fully supervised learning for the object detection task. Many methods have been proposed for WSOD to date, their performance is still lower than supervised approaches since WSOD is a very challenging task. The major problem with existing WSOD methods is partial object detection and false detection in an objects cluster with the same category. The majority of the methods on WSOD follow multiple instance learning approaches, which does not guarantee the completeness of detected objects. To address these issues, we propose a three-fold refinement strategy to proposals to learn complete instances. We generate class-specific localization maps by fused class activation maps obtained from fused complementary classification networks. These localization maps are used to amend the detected proposals from the instance classification branch (detection network). Deep reinforcement learning networks are proposed to learn decisive-agent and rectifying-agent based on policy gradient algorithm to further refine the proposals. The refined bounding boxes are then fed to instance classification network. The refinement operations result in learning complete objects and greatly improve detection performance. Experimental results show better detection performance by the proposed WSOD method compared to the state-of-the-art methods on PASCAL VOC2007 and VOC2012 benchmarks.

Read full abstract

Deep Reinforcement Learning Network Research Articles

Related Topics

Articles published on Deep Reinforcement Learning Network

Feasibility Analysis and Application of Reinforcement Learning Algorithm Based on Dynamic Parameter Adjustment

A Survey on Deep Reinforcement Learning Network for Traffic Light Cycle Control

UAV navigation in high dynamic environments: A deep reinforcement learning approach

Goal-Oriented Obstacle Avoidance with Deep Reinforcement Learning in Continuous Action Space

Collision avoidance for an unmanned surface vehicle using deep reinforcement learning

A Robust Context-Aware Proposal Refinement Method for Weakly Supervised Object Detection

Reinforcement Learning With Low-Complexity Liquid State Machines.

A Deep Reinforcement Learning Network for Traffic Light Cycle Control

Multi-agent deep learning for simultaneous optimization for time and energy in distributed routing system

A Deep Hierarchical Approach to Lifelong Learning in Minecraft

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Deep Reinforcement Learning Network Research Articles

Related Topics

Articles published on Deep Reinforcement Learning Network

Feasibility Analysis and Application of Reinforcement Learning Algorithm Based on Dynamic Parameter Adjustment

A Survey on Deep Reinforcement Learning Network for Traffic Light Cycle Control

UAV navigation in high dynamic environments: A deep reinforcement learning approach

Goal-Oriented Obstacle Avoidance with Deep Reinforcement Learning in Continuous Action Space

Collision avoidance for an unmanned surface vehicle using deep reinforcement learning

A Robust Context-Aware Proposal Refinement Method for Weakly Supervised Object Detection

Reinforcement Learning With Low-Complexity Liquid State Machines.

A Deep Reinforcement Learning Network for Traffic Light Cycle Control

Multi-agent deep learning for simultaneous optimization for time and energy in distributed routing system

A Deep Hierarchical Approach to Lifelong Learning in Minecraft