Approximate Dynamic Programming Based on High Dimensional Model Representation

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F67985556%3A_____%2F13%3A00399560" target="_blank" >RIV/67985556:_____/13:00399560 - isvavai.cz</a>
Výsledek na webu
—
DOI - Digital Object Identifier
—

Alternativní jazyky

Jazyk výsledku
angličtina
Název v původním jazyce
Approximate Dynamic Programming Based on High Dimensional Model Representation
Popis výsledku v původním jazyce
This article introduces an algorithm for implicit High Dimensional Model Representation (HDMR) of the Bellman equation. This approximation technique reduces memory demands of the algorithm considerably. Moreover, we show that HDMR enables fast approximate min- imization which is essential for evaluation of the Bellman function. In each time step, the problem of parametrized HDMR minimization is relaxed into trust region problems, all sharing the same matrix. Finding its eigenvalue decomposition, we effectively achieve estimates of all minima. Their full-domain representation is avoided by HDMR and then the same approach is used recursively in the next time step. An illustrative example of N-armed bandit problem is included. We assume that the newly established connection between approximate HDMR minimization and the trust region problem can be beneficial also to many other applications.
Název v anglickém jazyce
Approximate Dynamic Programming Based on High Dimensional Model Representation
Popis výsledku anglicky
This article introduces an algorithm for implicit High Dimensional Model Representation (HDMR) of the Bellman equation. This approximation technique reduces memory demands of the algorithm considerably. Moreover, we show that HDMR enables fast approximate min- imization which is essential for evaluation of the Bellman function. In each time step, the problem of parametrized HDMR minimization is relaxed into trust region problems, all sharing the same matrix. Finding its eigenvalue decomposition, we effectively achieve estimates of all minima. Their full-domain representation is avoided by HDMR and then the same approach is used recursively in the next time step. An illustrative example of N-armed bandit problem is included. We assume that the newly established connection between approximate HDMR minimization and the trust region problem can be beneficial also to many other applications.

Klasifikace

Druh
J<sub>x</sub> - Nezařazeno - Článek v odborném periodiku (Jimp, Jsc a Jost)
CEP obor
BC - Teorie a systémy řízení
OECD FORD obor
—

Návaznosti výsledku

Projekt
<a href="/cs/project/GAP102%2F11%2F0437" target="_blank" >GAP102/11/0437: Regulace a identifikace parametrů střídavých elektrických pohonů v kritických provozních stavech</a><br>
Návaznosti
I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace

Ostatní

Rok uplatnění
2013
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název periodika
Kybernetika
ISSN
0023-5954
e-ISSN
—
Svazek periodika
49
Číslo periodika v rámci svazku
5
Stát vydavatele periodika
CZ - Česká republika
Počet stran výsledku
18
Strana od-do
720-737
Kód UT WoS článku
000328665200004
EID výsledku v databázi Scopus
—

Podobné výsledky(10)

Approximation Methods for Optimal Active Fault Detection Minimizing Expected Termination Time in One-Counter Markov Decision Processes Interval Trust Representation

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Approximate Dynamic Programming Based on High Dimensional Model Representation

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Údaje specifické pro druh výsledku

Údaje specifické pro druh výsledku

Podobné výsledky(10)