Detailed Information on Publication Record
2023
Efficient Strategy Synthesis for MDPs With Resource Constraints
BLAHOUDEK, Fratišek, Petr NOVOTNÝ, Melkior ORNIK, Pranay THANGEDA, Ufuk TOPCU et. al.Basic information
Original name
Efficient Strategy Synthesis for MDPs With Resource Constraints
Authors
BLAHOUDEK, Fratišek (203 Czech Republic), Petr NOVOTNÝ (203 Czech Republic, guarantor, belonging to the institution), Melkior ORNIK (191 Croatia), Pranay THANGEDA (356 India) and Ufuk TOPCU (792 Turkey)
Edition
IEEE Transactions on Automatic Control, 2023, 0018-9286
Other information
Language
English
Type of outcome
Článek v odborném periodiku
Field of Study
10201 Computer sciences, information science, bioinformatics
Country of publisher
United States of America
Confidentiality degree
není předmětem státního či obchodního tajemství
References:
Impact factor
Impact factor: 6.800 in 2022
RIV identification code
RIV/00216224:14330/23:00134423
Organization unit
Faculty of Informatics
UT WoS
001041305400007
Keywords in English
Consumption Markov decision process (CMDP); planning; resource constraints; strategy synthesis
Změněno: 7/4/2024 23:48, RNDr. Pavel Šmerk, Ph.D.
Abstract
V originále
We consider qualitative strategy synthesis for the formalism called consumption Markov decision processes. This formalism can model the dynamics of an agent that operates under resource constraints in a stochastic environment. The presented algorithms work in time polynomial with respect to the representation of the model and they synthesize strategies ensuring that a given set of goal states will be reached (once or infinitely many times) with probability 1 without resource exhaustion. In particular, when the amount of resource becomes too low to safely continue in the mission, the strategy changes course of the agent toward one of a designated set of reload states where the agent replenishes the resource to full capacity; with a sufficient amount of resource, the agent attempts to fulfill the mission again. We also present two heuristics that attempt to reduce the expected time that the agent needs to fulfill the given mission, a parameter important in practical planning. The presented algorithms were implemented, and the numerical examples demonstrate the effectiveness (in terms of computation time) of the planning approach based on consumption Markov decision processes and the positive impact of the two heuristics on planning in a realistic example.
Links
GA21-24711S, research and development project |
|