Abstract
In this study, a novel model of the intelligent agent is proposed by introducing a dynamic emotion model into conventional action selection policy of the reinforcement learning method. Comparing with the conventional Q-learning of reinforcement learning, the proposed method adds two emotional factors in to the state-action value function: “arousal value” factor which affects motivation of action and “pleasure value” factor which influences the probability of action selection. The emotional factors are affected by the other agents when multiple agents exist in the perception area. Computer simulations of pursuit problems of static/dynamic preys were performed and all results showed effectiveness of the proposed method, i.e., faster learning convergence was confirmed comparing with the case of conventional Q-learning method.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have