Q-Learning: From Discrete to Continuous Representation
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216305%3A26210%2F04%3APU44552" target="_blank" >RIV/00216305:26210/04:PU44552 - isvavai.cz</a>
Alternative codes found
RIV/61388998:_____/04:00103681
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Q-Learning: From Discrete to Continuous Representation
Original language description
Q-learning standard algorithm is restricted by using discrete states and actions. In this case Q-function is usually represented as a discrete table of Q-values. Conversion of continuous variables to adequate discrete variables evokes some problems. Problems can be avoided if the continuous algorithm of Q-learning is used. In this paper we discus method, which is used to convert discrete to continuous algorithm. The method used suitable approximator to replace the discrete table. We choose local approxiimator called Locally Weighted Regression (LWR) (Atketson &Moore & Shaal, 1996) from the group of memory based approximators.
Czech name
Modifikace metody Q-učení z diskrétní na spojitou
Czech description
Tento článek pojednává o způsobu převedení standardní metody Q-učení, která je pouze diskrétní, na spojitou. K tomuto účelu je použit jednoduchý lokální aproximátor Lokálně vážená regrese (LVR). Tento aproximátor slouží k převedení diskrétní tabulky Q-hodnot na spojitou Q-funkci.
Classification
Type
J<sub>x</sub> - Unclassified - Peer-reviewed scientific article (Jimp, Jsc and Jost)
CEP classification
BC - Theory and management systems
OECD FORD branch
—
Result continuities
Project
—
Continuities
Z - Vyzkumny zamer (s odkazem do CEZ)
Others
Publication year
2004
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Elektronika
ISSN
0033-2089
e-ISSN
—
Volume of the periodical
XVL
Issue of the periodical within the volume
8
Country of publishing house
PL - POLAND
Number of pages
3
Pages from-to
12-14
UT code for WoS article
—
EID of the result in the Scopus database
—