Abstract

Unsupervised online learning in commercial computer games allows computer-controlled opponents to adapt to the way the game is being played. As such it provides a mechanism to deal with weaknesses in the game AI and to respond to changes in human player tactics. In prior work we designed a novel technique called “dynamic scripting” that is able to create successful adaptive opponents. However, experimental evaluations indicated that, occasionally, the time needed for dynamic scripting to generate effective opponents becomes unacceptably long. We investigated two different countermeasures against these long adaptation times (which we call “outliers”), namely a better balance between rewards and penalties, and a history-fallback mechanism. Experimental results indicate that a combination of these two countermeasures is able to reduce the number of outliers significantly. We therefore conclude that the performance of dynamic scripting is enhanced by these counter-measures.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.