Point-Based Planning for Multi-Objective POMDPs

D.M. Roijers; S. Whiteson; F.A. Oliehoek

Point-Based Planning for Multi-Objective POMDPs

Authors	D.M. Roijers S. Whiteson F.A. Oliehoek
Publication date	2015
Host editors	Q. Yang M. Wooldridge
Book title	Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence
Book subtitle	Buenos Aires, Argentina, 25-31 July 2015
ISBN	9781577357384
Event	24th International Joint Conference on Artificial Intelligence
Pages (from-to)	1666-1672
Publisher	Palo Alto, CA : AAAI Press
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	Many sequential decision-making problems require an agent to reason about both multiple objectives and uncertainty regarding the environment's state. Such problems can be naturally modelled as multi-objective partially observable Markov decision processes (MOPOMDPs). We propose optimistic linear support with alpha reuse (OLSAR), which computes a bounded approximation of the optimal solution set for all possible weightings of the objectives. The main idea is to solve a series of scalarized single-objective POMDPs, each corresponding to a different weighting of the objectives. A key insight underlying OLSAR is that the policies and value functions produced when solving scalarized POMDPs in earlier iterations can be reused to more quickly solve scalarized POMDPs in later iterations. We show experimentally that OLSAR outperforms, both in terms of runtime and approximation quality, alternative methods and a variant of OLSAR that does not leverage reuse.
Document type	Conference contribution
Language	English
Published at	https://www.ijcai.org/Abstract/15/238 (Final published version)
Downloads	238 (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Point-Based Planning for Multi-Objective POMDPs