Monte Carlo Tree Search with Options for General Video Game Playing

M. de Waard; D.M. Roijers; S.C.J. Bakkes

doi:https://doi.org/10.1109/CIG.2016.7860383

Monte Carlo Tree Search with Options for General Video Game Playing

Authors	M. de Waard D.M. Roijers S.C.J. Bakkes
Publication date	2016
Book title	2016 IEEE Conference on Computational Intelligence and Games (CIG 2016)
Book subtitle	Santorini, Greece, 20-23 September 2016
ISBN	9781509018840
ISBN (electronic)	9781509018833 9781509018826
Event	2016 IEEE Conference on Computational Intelligence and Games
Pages (from-to)	47-54
Number of pages	8
Publisher	Piscataway, NJ: IEEE
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	General video game playing is a challenging research area in which the goal is to find one algorithm that can play many games successfully. “Monte Carlo Tree Search” (MCTS) is a popular algorithm that has often been used for this purpose. It incrementally builds a search tree based on observed states after applying actions. However, the MCTS algorithm always plans over actions and does not incorporate any higher level planning, as one would expect from a human player. Furthermore, although many games have similar game dynamics, often no prior knowledge is available to general video game playing algorithms. In this paper, we introduce a new algorithm called “Option Monte Carlo Tree Search” (O-MCTS). It offers general video game knowledge and high level planning in the form of “options”, which are action sequences aimed at achieving a specific subgoal. Additionally, we introduce “Option Learning MCTS” (OL-MCTS), which applies a progressive widening technique to the expected returns of options in order to focus exploration on fruitful parts of the search tree. Our new algorithms are compared to MCTS on a diverse set of twenty-eight games from the general video game AI competition. Our results indicate that by using MCTS's efficient tree searching technique on options, O-MCTS outperforms MCTS on most of the games, especially those in which a certain subgoal has to be reached before the game can be won. Lastly, we show that OL-MCTS improves its performance on specific games by learning expected values for options and moving a bias to higher valued options.
Document type	Conference contribution
Language	English
Published at	https://doi.org/10.1109/CIG.2016.7860383
Other links	http://www.proceedings.com/33414.html
Downloads	Monte Carlo Tree Search with Options for General Video Game Playing (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Monte Carlo Tree Search with Options for General Video Game Playing