All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

Data-driven Construction of Symbolic Process Models for Reinforcement Learning

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68407700%3A21230%2F18%3A00324236" target="_blank" >RIV/68407700:21230/18:00324236 - isvavai.cz</a>

  • Alternative codes found

    RIV/68407700:21730/18:00324236

  • Result on the web

    <a href="http://dx.doi.org/10.1109/ICRA.2018.8461182" target="_blank" >http://dx.doi.org/10.1109/ICRA.2018.8461182</a>

  • DOI - Digital Object Identifier

    <a href="http://dx.doi.org/10.1109/ICRA.2018.8461182" target="_blank" >10.1109/ICRA.2018.8461182</a>

Alternative languages

  • Result language

    angličtina

  • Original language name

    Data-driven Construction of Symbolic Process Models for Reinforcement Learning

  • Original language description

    Reinforcement learning (RL) is a suitable approach for controlling systems with unknown or time-varying dynamics. RL in principle does not require a model of the system, but before it learns an acceptable policy, it needs many unsuccessful trials, which real robots usually cannot withstand. It is well known that RL can be sped up and made safer by using models learned online. In this paper, we propose to use symbolic regression to construct compact, parsimonious models described by analytic equations, which are suitable for real-time robot control. Single node genetic programming (SNGP) is employed as a tool to automatically search for equations fitting the available data. We demonstrate the approach on two benchmark examples: a simulated mobile robot and the pendulum swing-up problem; the latter both in simulations and real-time experiments. The results show that through this approach we can find accurate models even for small batches of training data. Based on the symbolic model found, RL can control the system well.

  • Czech name

  • Czech description

Classification

  • Type

    D - Article in proceedings

  • CEP classification

  • OECD FORD branch

    20204 - Robotics and automatic control

Result continuities

  • Project

    Result was created during the realization of more than one project. More information in the Projects tab.

  • Continuities

    P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>S - Specificky vyzkum na vysokych skolach

Others

  • Publication year

    2018

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

  • Article name in the collection

    Proceedings of the 2018 IEEE International Conference on Robotics and Automation

  • ISBN

    978-1-5386-3081-5

  • ISSN

    1050-4729

  • e-ISSN

  • Number of pages

    8

  • Pages from-to

    5105-5112

  • Publisher name

    IEEE

  • Place of publication

    Piscataway, NJ

  • Event location

    Brisbane

  • Event date

    May 21, 2018

  • Type of event by nationality

    WRD - Celosvětová akce

  • UT code for WoS article

    000446394503126