Efficient Strategy Synthesis for MDPs With Resource Constraints
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14330%2F23%3A00134423" target="_blank" >RIV/00216224:14330/23:00134423 - isvavai.cz</a>
Result on the web
<a href="https://ieeexplore.ieee.org/document/9903331" target="_blank" >https://ieeexplore.ieee.org/document/9903331</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1109/TAC.2022.3209612" target="_blank" >10.1109/TAC.2022.3209612</a>
Alternative languages
Result language
angličtina
Original language name
Efficient Strategy Synthesis for MDPs With Resource Constraints
Original language description
We consider qualitative strategy synthesis for the formalism called consumption Markov decision processes. This formalism can model the dynamics of an agent that operates under resource constraints in a stochastic environment. The presented algorithms work in time polynomial with respect to the representation of the model and they synthesize strategies ensuring that a given set of goal states will be reached (once or infinitely many times) with probability 1 without resource exhaustion. In particular, when the amount of resource becomes too low to safely continue in the mission, the strategy changes course of the agent toward one of a designated set of reload states where the agent replenishes the resource to full capacity; with a sufficient amount of resource, the agent attempts to fulfill the mission again. We also present two heuristics that attempt to reduce the expected time that the agent needs to fulfill the given mission, a parameter important in practical planning. The presented algorithms were implemented, and the numerical examples demonstrate the effectiveness (in terms of computation time) of the planning approach based on consumption Markov decision processes and the positive impact of the two heuristics on planning in a realistic example.
Czech name
—
Czech description
—
Classification
Type
J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
<a href="/en/project/GA21-24711S" target="_blank" >GA21-24711S: Efficient Analysis and Optimization for Probabilistic Systems and Games</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2023
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
IEEE Transactions on Automatic Control
ISSN
0018-9286
e-ISSN
1558-2523
Volume of the periodical
68
Issue of the periodical within the volume
8
Country of publishing house
US - UNITED STATES
Number of pages
16
Pages from-to
4586-4601
UT code for WoS article
001041305400007
EID of the result in the Scopus database
2-s2.0-85139456742