The Cost of Reinforcement Learning for Game Engines: The AZ-Hive Case-study

Authors
Publication date 2022
Book title ICPE '22
Book subtitle proceedings of the 2022 ACM/SPEC International Conference on Performance Engineering : April 9-13, 2022, Bejing, China
ISBN (electronic)
  • 9781450391436
Event 13th ACM/SPEC International Conference on Performance Engineering
Pages (from-to) 145-152
Publisher New York, NY: The Association for Computing Machinery
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Although utilising computers to play board games has been a topic of research for many decades, the recent rapid developments in the field of reinforcement learning - like AlphaZero and variants - brought unprecedented progress in games such as chess and Go. However, the efficiency of this process remains unknown. In this work, we analyse the cost and efficiency of the AlphaZero approach when building a new game engine. Thus, we present our experience building AZ-Hive, an AlphaZero-based playing engine for the game of Hive. Using only the rules of the game and a quality of play assessment, AZ-Hive learns to play the game from scratch. Getting AZ-Hive up and running requires encoding the game in AlphaZero, i.e., capturing the board, the game state, the rules and the assessment of play-quality. And different encodings lead to significantly different AZ-Hive engines, with very different performance results. Thus, we propose a design space for configuring AZ-Hive, and demonstrate the costs and benefits of different configurations in this space. We find that different configurations lead to a less or more competitive playing-engine, but the training and evaluation for different such engines is prohibitively expensive. Moreover, no systematic, efficient exploration or pruning of the space is possible. In turn, an exhaustive exploration can easily take tens of training-years.
Document type Conference contribution
Language English
Published at https://doi.org/10.1145/3489525.3511685
Permalink to this page
Back