Control of Magnetic Manipulator Using Reinforcement Learning Based on Incrementally Adapted Local Linear Models
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68407700%3A21730%2F21%3A00353630" target="_blank" >RIV/68407700:21730/21:00353630 - isvavai.cz</a>
Alternative codes found
RIV/00216305:26210/21:PU144837
Result on the web
<a href="https://doi.org/10.1155/2021/6617309" target="_blank" >https://doi.org/10.1155/2021/6617309</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1155/2021/6617309" target="_blank" >10.1155/2021/6617309</a>
Alternative languages
Result language
angličtina
Original language name
Control of Magnetic Manipulator Using Reinforcement Learning Based on Incrementally Adapted Local Linear Models
Original language description
Reinforcement learning (RL) agents can learn to control a nonlinear system without using a model of the system. However, having a model brings benefits, mainly in terms of a reduced number of unsuccessful trials before achieving acceptable control performance. Several modelling approaches have been used in the RL domain, such as neural networks, local linear regression, or Gaussian processes. In this article, we focus on techniques that have not been used much so far: symbolic regression (SR), based on genetic programming and local modelling. Using measured data, symbolic regression yields a nonlinear, continuous-time analytic model. We benchmark two state-of-the-art methods, SNGP (single-node genetic programming) and MGGP (multigene genetic programming), against a standard incremental local regression method called RFWR (receptive field weighted regression). We have introduced modifications to the RFWR algorithm to better suit the low-dimensional continuous-time systems we are mostly dealing with. The benchmark is a nonlinear, dynamic magnetic manipulation system. The results show that using the RL framework and a suitable approximation method, it is possible to design a stable controller of such a complex system without the necessity of any haphazard learning. While all of the approximation methods were successful, MGGP achieved the best results at the cost of higher computational complexity. Index Terms–AI-based methods, local linear regression, nonlinear systems, magnetic manipulation, model learning for control, optimal control, reinforcement learning, symbolic regression.
Czech name
—
Czech description
—
Classification
Type
J<sub>SC</sub> - Article in a specialist periodical, which is included in the SCOPUS database
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
—
Continuities
I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace
Others
Publication year
2021
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Complexity
ISSN
1076-2787
e-ISSN
1099-0526
Volume of the periodical
2021
Issue of the periodical within the volume
December
Country of publishing house
GB - UNITED KINGDOM
Number of pages
12
Pages from-to
—
UT code for WoS article
—
EID of the result in the Scopus database
2-s2.0-85122754808