Entropy-Aware Model Initialization for Effective Exploration In Deep Reinforcement Learning

Sooyoung Jang,Hyung-Il Kim

doi:10.2139/ssrn.4047895

Entropy-Aware Model Initialization for Effective Exploration In Deep Reinforcement Learning

Sooyoung Jang, Hyung-Il Kim

Open Access

PDF Available

https://doi.org/10.2139/ssrn.4047895

Copy DOI

Export

Save

Cite

Journal: SSRN Electronic Journal	Publication Date: Jan 1, 2022
Citations: 1

Affiliation: Electronics and Telecommunications Research Institute

#Deep Reinforcement Learning #Initial Entropy #Learning Speed #Learning Strategy #Deep Learning #Critical Issue #Issue In Learning #Effect Of Entropy #Strategy For Exploration #Low Value

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

Encouraging exploration is a critical issue in deep reinforcement learning. We investigate the effect of initial entropy that significantly influences the exploration, especially at the earlier stage. Our main observations are as follows: 1) low initial entropy increases the probability of learning failure, and 2) this initial entropy is biased towards a low value that inhibits exploration. Inspired by the investigations, we devise entropy-aware model initialization, a simple yet powerful learning strategy for effective exploration. We show that the devised learning strategy significantly reduces learning failures and enhances performance, stability, and learning speed through experiments.

Full Text

Submitted Version (Free)

View/Download pdf

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: SSRN Electronic Journal

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

Entropy-Aware Model Initialization for Effective Exploration In Deep Reinforcement Learning