Inverse Reinforcement Learning with Agents’ Biased Exploration Based on Sub-Optimal Sequential Action Data

Fumito Uwano,Keiki Takadama,Satoshi Hasegawa

doi:10.20965/jaciii.2024.p0380

Inverse Reinforcement Learning with Agents’ Biased Exploration Based on Sub-Optimal Sequential Action Data

Fumito Uwano, Keiki Takadama + Show 1 more

Open Access

https://doi.org/10.20965/jaciii.2024.p0380

Copy DOI

Journal: Journal of Advanced Computational Intelligence and Intelligent Informatics	Publication Date: Mar 20, 2024
License type: cc-by-nd

Affiliation: Okayama University, University of Electro-Communications

#Inverse Reinforcement Learning #Sub-optimal Data + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Inverse reinforcement learning (IRL) estimates a reward function for an agent to behave along with expert data, e.g., as human operation data. However, expert data usually have redundant parts, which decrease the agent’s performance. This study extends the IRL to sub-optimal action data, including lack and detour. The proposed method searches for new actions to determine optimal expert action data. This study adopted maze problems with sub-optimal expert action data to investigate the performance of the proposed method. The experimental results show that the proposed method finds optimal expert data better than the conventional method, and the proposed search mechanisms perform better than random search.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Journal of Advanced Computational Intelligence and Intelligent Informatics

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.