Gaussian Process Based Model-free Control with Q-Learning

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68407700%3A21230%2F19%3A00335629" target="_blank" >RIV/68407700:21230/19:00335629 - isvavai.cz</a>
Výsledek na webu
<a href="https://doi.org/10.1016/j.ifacol.2019.09.147" target="_blank" >https://doi.org/10.1016/j.ifacol.2019.09.147</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1016/j.ifacol.2019.09.147" target="_blank" >10.1016/j.ifacol.2019.09.147</a>

Alternativní jazyky

Jazyk výsledku
angličtina
Název v původním jazyce
Gaussian Process Based Model-free Control with Q-Learning
Popis výsledku v původním jazyce
The aim of this paper is to demonstrate a new algorithm for Machine Learning (ML) based on Gaussian Process Regression (GPR) and how it can be used as a practical control design technique. An optimized control law for a nonlinear process is found directly by training the algorithm on noisy data collected from the process when controlled by a suboptimal controller. A simplified nonlinear Fan Coil Unit (FCU) model is used as an example for which the fan speed control is designed using the off-policy Q -learning algorithm. Additionally, the algorithm properties are discussed, i.e. learning process robustness, Gaussian Process (GP) kernel functions choice. The simulation results are compared to a simple PI design based on a linearized model. (C) 2019, IFAC (International Federation of Automatic Control) Hosting by Elsevier Ltd. All rights reserved.
Název v anglickém jazyce
Gaussian Process Based Model-free Control with Q-Learning
Popis výsledku anglicky
The aim of this paper is to demonstrate a new algorithm for Machine Learning (ML) based on Gaussian Process Regression (GPR) and how it can be used as a practical control design technique. An optimized control law for a nonlinear process is found directly by training the algorithm on noisy data collected from the process when controlled by a suboptimal controller. A simplified nonlinear Fan Coil Unit (FCU) model is used as an example for which the fan speed control is designed using the off-policy Q -learning algorithm. Additionally, the algorithm properties are discussed, i.e. learning process robustness, Gaussian Process (GP) kernel functions choice. The simulation results are compared to a simple PI design based on a linearized model. (C) 2019, IFAC (International Federation of Automatic Control) Hosting by Elsevier Ltd. All rights reserved.

Klasifikace

Druh
D - Stať ve sborníku
CEP obor
—
OECD FORD obor
20205 - Automation and control systems

Návaznosti výsledku

Projekt
<a href="/cs/project/GA18-26278S" target="_blank" >GA18-26278S: Zahrnutí apriorní informace při identifikaci nelineárních systémů</a><br>
Návaznosti
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)

Ostatní

Rok uplatnění
2019
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název statě ve sborníku
IFAC PAPERSONLINE
ISBN
—
ISSN
2405-8963
e-ISSN
2405-8963
Počet stran výsledku
8
Strana od-do
236-243
Název nakladatele
IFAC
Místo vydání
Laxenburg
Místo konání akce
Belfast
Datum konání akce
21. 8. 2019
Typ akce podle státní příslušnosti
WRD - Celosvětová akce
Kód UT WoS článku
000493064700041

Podobné výsledky(10)

Control of Magnetic Manipulator Using Reinforcement Learning Based on Incrementally Adapted Local Linear Models Functional Dual Adaptive Control with Recursive Gaussian Process Model Nelineární predikující regulátor s lokálně váženým aproximátorem pro řížení aktivního magnetického ložiska

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Gaussian Process Based Model-free Control with Q-Learning

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Údaje specifické pro druh výsledku

Údaje specifické pro druh výsledku

Podobné výsledky(10)