BRÁZDIL, Tomáš, Krishnendu CHATTERJEE, Vojtěch FOREJT and Antonín KUČERA. Trading Performance for Stability in Markov Decision Processes. In Proceedings of 28th Annual ACM/IEEE Symposium on Logic in Computer Science (LICS 2013). London: IEEE Computer Society, 2013, p. 331-340. ISBN 978-1-4799-0413-6. Available from: https://dx.doi.org/10.1109/LICS.2013.39.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Trading Performance for Stability in Markov Decision Processes
Authors BRÁZDIL, Tomáš (203 Czech Republic, belonging to the institution), Krishnendu CHATTERJEE (356 India), Vojtěch FOREJT (203 Czech Republic, belonging to the institution) and Antonín KUČERA (203 Czech Republic, guarantor, belonging to the institution).
Edition London, Proceedings of 28th Annual ACM/IEEE Symposium on Logic in Computer Science (LICS 2013), p. 331-340, 10 pp. 2013.
Publisher IEEE Computer Society
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher United States of America
Confidentiality degree is not subject to a state or trade secret
Publication form storage medium (CD, DVD, flash disk)
RIV identification code RIV/00216224:14330/13:00066541
Organization unit Faculty of Informatics
ISBN 978-1-4799-0413-6
ISSN 1043-6871
Doi http://dx.doi.org/10.1109/LICS.2013.39
UT WoS 000326815000038
Keywords (in Czech) Markovovy rozhodovací procesy; optimalizace
Keywords in English Markov decision processes; optimization
Tags core_A, firank_1, formela-conference
Tags International impact, Reviewed
Changed by Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 24/4/2014 18:43.
Abstract
We study the complexity of central controller synthesis problems for finite-state Markov decision processes, where the objective is to optimize both the expected mean-payoff performance of the system and its stability. We argue that the basic theoretical notion of expressing the stability in terms of the variance of the mean-payoff (called global variance in our paper) is not always sufficient, since it ignores possible instabilities on respective runs. For this reason we propose alernative definitions of stability, which we call local and hybrid variance, and which express how rewards on each run deviate from the run's own mean-payoff and from the expected mean-payoff, respectively.
Links
GPP202/12/P612, research and development projectName: Formální verifikace stochastických systémů s reálným časem (Acronym: Formální verifikace stochastických systémů s reáln)
Investor: Czech Science Foundation
PrintDisplayed: 14/6/2024 14:15