Unifying Two Views on Multiple Mean-Payoff Objectives in Markov
Decision Processes

CHATTERJEE, Krishnendu, Zuzana KOMÁRKOVÁ a Jan KŘETÍNSKÝ. Unifying Two Views on Multiple Mean-Payoff Objectives in Markov Decision Processes. In Thirtieth Annual ACM/IEEE Symposium on Logic in Computer Science (LICS). Los Alamitos, California: IEEE, 2015, s. 244-256. ISBN 978-1-4799-8875-4. Dostupné z: https://dx.doi.org/10.1109/LICS.2015.32.

Další formáty: BibTeX LaTeX RIS

Základní údaje
Originální název	Unifying Two Views on Multiple Mean-Payoff Objectives in Markov Decision Processes
Autoři	CHATTERJEE, Krishnendu (356 Indie), Zuzana KOMÁRKOVÁ (203 Česká republika, domácí) a Jan KŘETÍNSKÝ (203 Česká republika, garant, domácí).
Vydání	Los Alamitos, California, Thirtieth Annual ACM/IEEE Symposium on Logic in Computer Science (LICS), od s. 244-256, 13 s. 2015.
Nakladatel	IEEE

Další údaje
Originální jazyk	angličtina
Typ výsledku	Stať ve sborníku
Obor	10201 Computer sciences, information science, bioinformatics
Stát vydavatele	Spojené státy
Utajení	není předmětem státního či obchodního tajemství
Forma vydání	tištěná verze "print"
Kód RIV	RIV/00216224:14330/15:00080917
Organizační jednotka	Fakulta informatiky
ISBN	978-1-4799-8875-4
ISSN	1043-6871
Doi	http://dx.doi.org/10.1109/LICS.2015.32
UT WoS	000380427100024
Klíčová slova anglicky	Markov decision process; mean payoff; optimization; probability
Štítky	core_A, firank_1, formela-conference
Příznaky	Mezinárodní význam, Recenzováno
Změnil	Změnil: RNDr. Pavel Šmerk, Ph.D., učo 3880. Změněno: 23. 10. 2017 12:46.

Anotace

We consider Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) objectives. There exist two different views: (i)~the expectation semantics, where the goal is to optimize the expected mean-payoff objective, and (ii)~the satisfaction semantics, where the goal is to maximize the probability of runs such that the mean-payoff value stays above a given vector. We consider optimization with respect to both objectives at once, thus unifying the existing semantics. Precisely, the goal is to optimize the expectation while ensuring the satisfaction constraint. Our problem captures the notion of optimization with respect to strategies that are risk-averse (i.e., ensure certain probabilistic guarantee). Our main results are as follows: First, we present algorithms for the decision problems, which are always polynomial in the size of the MDP. We also show that an approximation of the Pareto curve can be computed in time polynomial in the size of the MDP, and the approximation factor, but exponential in the number of dimensions. Second, we present a complete characterization of the strategy complexity (in terms of memory bounds and randomization) required to solve our problem.

Návaznosti
GBP202/12/G061, projekt VaV	Název: Centrum excelence - Institut teoretické informatiky (CE-ITI) (Akronym: CE-ITI)
GBP202/12/G061, projekt VaV	Investor: Grantová agentura ČR, Centrum excelence - Institut teoretické informatiky

VytisknoutZobrazeno: 25. 4. 2024 13:21

Unifying Two Views on Multiple Mean-Payoff Objectives in Markov Decision Processes

Další aplikace