All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

Q-Learning: From Discrete to Continuous Representation

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216305%3A26210%2F04%3APU44552" target="_blank" >RIV/00216305:26210/04:PU44552 - isvavai.cz</a>

  • Alternative codes found

    RIV/61388998:_____/04:00103681

  • Result on the web

  • DOI - Digital Object Identifier

Alternative languages

  • Result language

    angličtina

  • Original language name

    Q-Learning: From Discrete to Continuous Representation

  • Original language description

    Q-learning standard algorithm is restricted by using discrete states and actions. In this case Q-function is usually represented as a discrete table of Q-values. Conversion of continuous variables to adequate discrete variables evokes some problems. Problems can be avoided if the continuous algorithm of Q-learning is used. In this paper we discus method, which is used to convert discrete to continuous algorithm. The method used suitable approximator to replace the discrete table. We choose local approxiimator called Locally Weighted Regression (LWR) (Atketson &Moore & Shaal, 1996) from the group of memory based approximators.

  • Czech name

    Modifikace metody Q-učení z diskrétní na spojitou

  • Czech description

    Tento článek pojednává o způsobu převedení standardní metody Q-učení, která je pouze diskrétní, na spojitou. K tomuto účelu je použit jednoduchý lokální aproximátor Lokálně vážená regrese (LVR). Tento aproximátor slouží k převedení diskrétní tabulky Q-hodnot na spojitou Q-funkci.

Classification

  • Type

    J<sub>x</sub> - Unclassified - Peer-reviewed scientific article (Jimp, Jsc and Jost)

  • CEP classification

    BC - Theory and management systems

  • OECD FORD branch

Result continuities

  • Project

  • Continuities

    Z - Vyzkumny zamer (s odkazem do CEZ)

Others

  • Publication year

    2004

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

  • Name of the periodical

    Elektronika

  • ISSN

    0033-2089

  • e-ISSN

  • Volume of the periodical

    XVL

  • Issue of the periodical within the volume

    8

  • Country of publishing house

    PL - POLAND

  • Number of pages

    3

  • Pages from-to

    12-14

  • UT code for WoS article

  • EID of the result in the Scopus database