BRÁZDIL, Tomáš, Krishnendu CHATTERJEE, Vojtěch FOREJT and Antonín KUČERA. Trading performance for stability in Markov decision processes. Journal of Computer and System Sciences. SAN DIEGO: Elsevier, 2017, vol. 84, No 2017, p. 144-170. ISSN 0022-0000. Available from: https://dx.doi.org/10.1016/j.jcss.2016.09.009.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Trading performance for stability in Markov decision processes
Authors BRÁZDIL, Tomáš (203 Czech Republic, belonging to the institution), Krishnendu CHATTERJEE (40 Austria), Vojtěch FOREJT (203 Czech Republic, belonging to the institution) and Antonín KUČERA (203 Czech Republic, guarantor, belonging to the institution).
Edition Journal of Computer and System Sciences, SAN DIEGO, Elsevier, 2017, 0022-0000.
Other information
Original language English
Type of outcome Article in a journal
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher United States of America
Confidentiality degree is not subject to a state or trade secret
Impact factor Impact factor: 1.497
RIV identification code RIV/00216224:14330/17:00094587
Organization unit Faculty of Informatics
Doi http://dx.doi.org/10.1016/j.jcss.2016.09.009
UT WoS 000388430000011
Keywords in English Markov decision processes; Mean payoff; Stability; Stochastic systems; Controller synthesis
Tags formela-journal
Tags International impact, Reviewed
Changed by Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 27/4/2018 10:34.
Abstract
We study controller synthesis problems for finite-state Markov decision processes, where the objective is to optimize the expected mean-payoff performance and stability (also known as variability in the literature). We argue that the basic notion of expressing the stability using the statistical variance of the mean payoff is sometimes insufficient, and propose an alternative definition. We show that a strategy ensuring both the expected mean payoff and the variance below given bounds requires randomization and memory, under both the above definitions. We then show that the problem of finding such a strategy can be expressed as a set of constraints.
Links
GA15-17564S, research and development projectName: Teorie her jako prostředek pro formální analýzu a verifikaci počítačových systémů
Investor: Czech Science Foundation
PrintDisplayed: 23/7/2024 02:36