Verification of Markov Decision Processes using Learning
Algorithms

BRÁZDIL, Tomáš, Krishnendu CHATTERJEE, Martin CHMELÍK, Vojtěch FOREJT, Jan KŘETÍNSKÝ, Marta KWIATKOWSKA, David PARKER and Mateusz UJMA. Verification of Markov Decision Processes using Learning Algorithms. In Automated Technology for Verification and Analysis - 12th International Symposium, ATVA 2014. Heidelberg Dordrecht London New York: Springer, 2014, p. 98-114. ISBN 978-3-319-11935-9. Available from: https://dx.doi.org/10.1007/978-3-319-11936-6_8.

Other formats: BibTeX LaTeX RIS

Basic information
Original name	Verification of Markov Decision Processes using Learning Algorithms
Authors	BRÁZDIL, Tomáš (203 Czech Republic, belonging to the institution), Krishnendu CHATTERJEE (356 India), Martin CHMELÍK (203 Czech Republic), Vojtěch FOREJT (203 Czech Republic), Jan KŘETÍNSKÝ (203 Czech Republic, guarantor, belonging to the institution), Marta KWIATKOWSKA (616 Poland), David PARKER (826 United Kingdom of Great Britain and Northern Ireland) and Mateusz UJMA (616 Poland).
Edition	Heidelberg Dordrecht London New York, Automated Technology for Verification and Analysis - 12th International Symposium, ATVA 2014, p. 98-114, 17 pp. 2014.
Publisher	Springer

Other information
Original language	English
Type of outcome	Proceedings paper
Field of Study	10201 Computer sciences, information science, bioinformatics
Country of publisher	Germany
Confidentiality degree	is not subject to a state or trade secret
Publication form	printed version "print"
Impact factor	Impact factor: 0.402 in 2005
RIV identification code	RIV/00216224:14330/14:00075875
Organization unit	Faculty of Informatics
ISBN	978-3-319-11935-9
ISSN	0302-9743
Doi	http://dx.doi.org/10.1007/978-3-319-11936-6_8
Keywords in English	stochastic systems; verification; machine learning; statistical model checking; reinforcement learning
Tags	core_A, firank_A, formela-conference
Tags	International impact, Reviewed
Changed by	Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 27/4/2015 05:45.

Abstract

We present a general framework for applying machine-learning algorithms to the verification of Markov decision processes (MDPs). The primary goal of these techniques is to improve performance by avoiding an exhaustive exploration of the state space. Our framework focuses on probabilistic reachability, which is a core property for verification, and is illustrated through two distinct instantiations. The first assumes that full knowledge of the MDP is available, and performs a heuristic-driven partial exploration of the model, yielding precise lower and upper bounds on the required probability. The second tackles the case where we may only sample the MDP, and yields probabilistic guarantees, again in terms of both the lower and upper bounds, which provides efficient stopping criteria for the approximation. The latter is the first extension of statistical model checking for unbounded properties in MDPs. In contrast with other related approaches, we do not restrict our attention to time-bounded (finite-horizon) or discounted properties, nor assume any particular properties of the MDP. We also show how our techniques extend to LTL objectives. We present experimental results showing the performance of our framework on several examples.

Links
MUNI/A/0765/2013, interní kód MU	Name: Zapojení studentů Fakulty informatiky do mezinárodní vědecké komunity (Acronym: SKOMU)
MUNI/A/0765/2013, interní kód MU	Investor: Masaryk University, Category A
MUNI/A/0855/2013, interní kód MU	Name: Rozsáhlé výpočetní systémy: modely, aplikace a verifikace III. (Acronym: FI MAV III.)
MUNI/A/0855/2013, interní kód MU	Investor: Masaryk University, Category A

PrintDisplayed: 12/5/2024 09:47

Verification of Markov Decision Processes using Learning Algorithms

Other applications