Unifying Two Views on Multiple Mean-Payoff Objectives in Markov
Decision Processes

CHATTERJEE, Krishnendu, Zuzana KOMÁRKOVÁ and Jan KŘETÍNSKÝ. Unifying Two Views on Multiple Mean-Payoff Objectives in Markov Decision Processes. In Thirtieth Annual ACM/IEEE Symposium on Logic in Computer Science (LICS). Los Alamitos, California: IEEE, 2015, p. 244-256. ISBN 978-1-4799-8875-4. Available from: https://dx.doi.org/10.1109/LICS.2015.32.

Other formats: BibTeX LaTeX RIS

Basic information
Original name	Unifying Two Views on Multiple Mean-Payoff Objectives in Markov Decision Processes
Authors	CHATTERJEE, Krishnendu (356 India), Zuzana KOMÁRKOVÁ (203 Czech Republic, belonging to the institution) and Jan KŘETÍNSKÝ (203 Czech Republic, guarantor, belonging to the institution).
Edition	Los Alamitos, California, Thirtieth Annual ACM/IEEE Symposium on Logic in Computer Science (LICS), p. 244-256, 13 pp. 2015.
Publisher	IEEE

Other information
Original language	English
Type of outcome	Proceedings paper
Field of Study	10201 Computer sciences, information science, bioinformatics
Country of publisher	United States of America
Confidentiality degree	is not subject to a state or trade secret
Publication form	printed version "print"
RIV identification code	RIV/00216224:14330/15:00080917
Organization unit	Faculty of Informatics
ISBN	978-1-4799-8875-4
ISSN	1043-6871
Doi	http://dx.doi.org/10.1109/LICS.2015.32
UT WoS	000380427100024
Keywords in English	Markov decision process; mean payoff; optimization; probability
Tags	core_A, firank_1, formela-conference
Tags	International impact, Reviewed
Changed by	Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 23/10/2017 12:46.

Abstract

We consider Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) objectives. There exist two different views: (i)~the expectation semantics, where the goal is to optimize the expected mean-payoff objective, and (ii)~the satisfaction semantics, where the goal is to maximize the probability of runs such that the mean-payoff value stays above a given vector. We consider optimization with respect to both objectives at once, thus unifying the existing semantics. Precisely, the goal is to optimize the expectation while ensuring the satisfaction constraint. Our problem captures the notion of optimization with respect to strategies that are risk-averse (i.e., ensure certain probabilistic guarantee). Our main results are as follows: First, we present algorithms for the decision problems, which are always polynomial in the size of the MDP. We also show that an approximation of the Pareto curve can be computed in time polynomial in the size of the MDP, and the approximation factor, but exponential in the number of dimensions. Second, we present a complete characterization of the strategy complexity (in terms of memory bounds and randomization) required to solve our problem.

Links
GBP202/12/G061, research and development project	Name: Centrum excelence - Institut teoretické informatiky (CE-ITI) (Acronym: CE-ITI)
GBP202/12/G061, research and development project	Investor: Czech Science Foundation

PrintDisplayed: 26/4/2024 19:04

Unifying Two Views on Multiple Mean-Payoff Objectives in Markov Decision Processes

Other applications