D 2013

Trading Performance for Stability in Markov Decision Processes

BRÁZDIL, Tomáš, Krishnendu CHATTERJEE, Vojtěch FOREJT and Antonín KUČERA

Basic information

Original name

Trading Performance for Stability in Markov Decision Processes

Authors

BRÁZDIL, Tomáš (203 Czech Republic, belonging to the institution), Krishnendu CHATTERJEE (356 India), Vojtěch FOREJT (203 Czech Republic, belonging to the institution) and Antonín KUČERA (203 Czech Republic, guarantor, belonging to the institution)

Edition

London, Proceedings of 28th Annual ACM/IEEE Symposium on Logic in Computer Science (LICS 2013), p. 331-340, 10 pp. 2013

Publisher

IEEE Computer Society

Other information

Language

English

Type of outcome

Stať ve sborníku

Field of Study

10201 Computer sciences, information science, bioinformatics

Country of publisher

United States of America

Confidentiality degree

není předmětem státního či obchodního tajemství

Publication form

storage medium (CD, DVD, flash disk)

RIV identification code

RIV/00216224:14330/13:00066541

Organization unit

Faculty of Informatics

ISBN

978-1-4799-0413-6

ISSN

UT WoS

000326815000038

Keywords (in Czech)

Markovovy rozhodovací procesy; optimalizace

Keywords in English

Markov decision processes; optimization

Tags

International impact, Reviewed
Změněno: 24/4/2014 18:43, RNDr. Pavel Šmerk, Ph.D.

Abstract

V originále

We study the complexity of central controller synthesis problems for finite-state Markov decision processes, where the objective is to optimize both the expected mean-payoff performance of the system and its stability. We argue that the basic theoretical notion of expressing the stability in terms of the variance of the mean-payoff (called global variance in our paper) is not always sufficient, since it ignores possible instabilities on respective runs. For this reason we propose alernative definitions of stability, which we call local and hybrid variance, and which express how rewards on each run deviate from the run's own mean-payoff and from the expected mean-payoff, respectively.

Links

GPP202/12/P612, research and development project
Name: Formální verifikace stochastických systémů s reálným časem (Acronym: Formální verifikace stochastických systémů s reáln)
Investor: Czech Science Foundation