Abstract
This paper presents a novel game environment, GuessWhat+, for visual dialogue research, and proposes an efficient deep reinforcement learning algorithm, MRRB, for optimizing visual questions. GuessWhat+ is an extended version of the existing visual dialogue environment, GuessWhat?!. In order to overcome the limitations of GuessWhat?!, it enables the participating agents to utilize immediate rewards from games. The proposed deep reinforcement learning algorithm, MRRB(Mini-Batch REINFORCE with Return Baseline) is a new policy gradient algorithm to meet both the data inefficiency problem and the unstable convergence problem. Experiments showed the usefulness of the GuessWhat+ environment and the high performance of the proposed MRRB algorithm.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have