Second Order Optimality in Markov and Semi-Markov Decision Processes
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F67985556%3A_____%2F19%3A00517875" target="_blank" >RIV/67985556:_____/19:00517875 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Second Order Optimality in Markov and Semi-Markov Decision Processes
Original language description
Semi-Markov decision processes can be considered as an extension of discrete- and continuous-time Markov reward models. Unfortunately, traditional optimality criteria as long-run average reward per time may be quite insufficient to characterize the problem from the point of a decision maker. To this end it may be preferable if not necessary to select more sophisticated criteria that also reflect variability-risk features of the problem. Perhaps the best known approaches stem from the classical work of Markowitz on mean-variance selection rules, i.e. we optimize the weighted sum of average or total reward and its variance. Such approach has been already studied for very special classes of semi-Markov decision processes, in particular, for Markov decision processes in discrete - and continuous-time setting. In this note these approaches are summarized and possible extensions to the wider class of semi-Markov decision processes is discussed. Attention is mostly restricted to uncontrolled models in which the chain is aperiodic and contains a single class of recurrent states. Considering finite time horizons, explicit formulas for the first and second moments of total reward as well as for the corresponding variance are produced.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10103 - Statistics and probability
Result continuities
Project
<a href="/en/project/GA18-02739S" target="_blank" >GA18-02739S: Stochastic Optimization in Economic Processes</a><br>
Continuities
I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace
Others
Publication year
2019
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Conference Proceedings. 37th International Conference on Mathematical Methods in Economics 2019
ISBN
978-80-7394-760-6
ISSN
—
e-ISSN
—
Number of pages
6
Pages from-to
338-343
Publisher name
University of South Bohemia in České Budějovice, Faculty of Economics
Place of publication
České Budějovice
Event location
České Budějovice
Event date
Sep 11, 2019
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—