Abstract

UCT (Upper confidential bounds on Trees) has been applied quite well as a selection approach in MCTS(Monte Carlo Tree Search) in imperfect information games like poker. By using risk dominance as complementary part of decision method besides payoff dominance, opponent strategies is better characterized as their risk factors, like bluff actions in Texas Hold’em Poker . In this paper, estimation method about the influence of risk factors on computing game equilibrium is provided. A novel algorithm, UCT-risk is proposed as modification about UCT algorithm basing on risk estimation methods. To verify the performance of new algorithm, Texas Hold’em, a popular test-bed for AI research is chosen as the experiment platform. The Agent adopted UCT-risk algorithm performs as well or better as the best previous approaches in experiments. And also it is applied in a poker agent named HITSZ_CS_13 in the 2013 AAAI Computer Poker Competition, which confirms the effectiveness of the UCT-risk provided in this paper.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call