Simple Strategies in Multi-Objective MDPs

European Joint Conferences on Theory and Practice of Software - ETAPS - e. V.

Quatmann, Tim

Formale Metadaten

Titel

Serientitel

26th International Conference on Tools and Algorithms for the Construction and Analysis of Systems, TACAS 2020

Anzahl der Teile

Autor

Mitwirkende

Lizenz

CC-Namensnennung 3.0 Deutschland:
Sie dürfen das Werk bzw. den Inhalt zu jedem legalen Zweck nutzen, verändern und in unveränderter oder veränderter Form vervielfältigen, verbreiten und öffentlich zugänglich machen, sofern Sie den Namen des Autors/Rechteinhabers in der von ihm festgelegten Weise nennen.

Identifikatoren

10.5446/55030 (DOI)

Herausgeber

European Joint Conferences on Theory and Practice of Software - ETAPS - e. V.

Erscheinungsjahr

2021

Sprache

Englisch

Inhaltliche Metadaten

Fachgebiet

Informatik Mathematik

Genre

Konferenz/Talk

Abstract

We consider the verification of multiple expected reward objectives at once on Markov decision processes (MDPs). This enables a trade-off analysis among multiple objectives by obtaining the Pareto front. We focus on strategies that are easy to employ and implement. That is, strategies that are pure (no randomization) and have bounded memory. We show that checking whether a point is achievable by a pure stationary strategy is NP-complete, even for two objectives, and we provide an MILP encoding to solve the corresponding problem. The bounded memory case can be reduced to the stationary one by a product construction. Experimental results using Storm and Gurobi show the feasibility of our algorithms.

Schlagwörter

Strategy Synthesis

Multi-objective optimization

Markov Decision Processes

Probabilistic Systems