CHATTERJEE, Krishnendu, Martin CHMELÍK, Deep KARKHANIS, Petr NOVOTNÝ and Amélie ROYER. Multiple-Environment Markov Decision Processes: Efficient Analysis and Applications. Online. In Proceedings of the International Conference on Automated Planning and Scheduling. Palo Alto: AAAI Press, 2020, p. 48-56. ISBN 978-1-57735-824-4.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Multiple-Environment Markov Decision Processes: Efficient Analysis and Applications
Authors CHATTERJEE, Krishnendu (356 India), Martin CHMELÍK (203 Czech Republic), Deep KARKHANIS (356 India), Petr NOVOTNÝ (203 Czech Republic, guarantor, belonging to the institution) and Amélie ROYER (250 France).
Edition Palo Alto, Proceedings of the International Conference on Automated Planning and Scheduling, p. 48-56, 9 pp. 2020.
Publisher AAAI Press
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10200 1.2 Computer and information sciences
Country of publisher United States of America
Confidentiality degree is not subject to a state or trade secret
Publication form electronic version available online
WWW URL
RIV identification code RIV/00216224:14330/20:00114616
Organization unit Faculty of Informatics
ISBN 978-1-57735-824-4
ISSN 2334-0835
Keywords in English decision making; Markov decision processes; contextual recommendations
Tags core_A, firank_1, formela-dec
Tags International impact, Reviewed
Changed by Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 29/4/2021 08:12.
Abstract
Multiple-environment Markov decision processes (MEMDPs) are MDPs equipped with not one, but multiple probabilistic transition functions, which represent the various possible unknown environments. While the previous research on MEMDPs focused on theoretical properties for long-run average payoff, we study them with discounted-sum payoff and focus on their practical advantages and applications. MEMDPs can be viewed as a special case of Partially observable and Mixed observability MDPs: the state of the system is perfectly observable, but not the environment. We show that the specific structure of MEMDPs allows for more efficient algorithmic analysis, in particular for faster belief updates. We demonstrate the applicability of MEMDPs in several domains. In particular, we formalize the sequential decision-making approach to contextual recommendation systems as MEMDPs and substantially improve over the previous MDP approach.
Links
GA19-15134Y, interní kód MUName: Verifikace a analýza pravděpodobnostních programů
Investor: Czech Science Foundation
GJ19-15134Y, research and development projectName: Verifikace a analýza pravděpodobnostních programů
PrintDisplayed: 22/5/2024 07:37