Abstract

AbstractThis paper analyzes how individuals resolve an exploration versus exploitation trade‐off in a laboratory experiment. The experiment implements the single‐agent exponential bandit model. We analyze how subjects respond to changes in the prior belief, safe action, and discount factor. We find that subjects respond in the predicted direction to these changes. However, we find that subjects under‐respond to the prior belief, under‐respond to the safe action, and typically explore less than predicted. Our results suggest that neither risk aversion nor the random termination probability are driving under‐experimentation. Our results are consistent with subjects having incorrect beliefs about exploration.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call