Federated Reinforcement Learning for Collective Navigation of Robotic Swarms
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68407700%3A21230%2F23%3A00373781" target="_blank" >RIV/68407700:21230/23:00373781 - isvavai.cz</a>
Result on the web
<a href="https://doi.org/10.1109/TCDS.2023.3239815" target="_blank" >https://doi.org/10.1109/TCDS.2023.3239815</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1109/TCDS.2023.3239815" target="_blank" >10.1109/TCDS.2023.3239815</a>
Alternative languages
Result language
angličtina
Original language name
Federated Reinforcement Learning for Collective Navigation of Robotic Swarms
Original language description
The recent advancement of deep reinforcement learning (DRL) contributed to robotics by allowing automatic controller design. The automatic controller design is a crucial approach for designing swarm robotic systems, which require more complex controllers than a single robot system to lead a desired collective behavior. Although the DRL-based controller design method showed its effectiveness for swarm robotic systems, the reliance on the central training server is a critical problem in real-world environments where robot-server communication is unstable or limited. We propose a novel federated learning (FL)-based DRL training strategy federated learning DDPG (FLDDPG) for use in swarm robotic applications. Through the comparison with baseline strategies under a limited communication bandwidth scenario, it is shown that the FLDDPG method resulted in higher robustness and generalization ability into a different environment and real robots, while the baseline strategies suffer from the limitation of communication bandwidth. This result suggests that the proposed method can benefit swarm robotic systems operating in environments with limited communication bandwidth, e.g., in high radiation, underwater, or subterranean environments.
Czech name
—
Czech description
—
Classification
Type
J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2023
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
IEEE Transactions on Cognitive and Developmental Systems
ISSN
2379-8920
e-ISSN
2379-8939
Volume of the periodical
15
Issue of the periodical within the volume
4
Country of publishing house
US - UNITED STATES
Number of pages
10
Pages from-to
2122-2131
UT code for WoS article
001126639000051
EID of the result in the Scopus database
2-s2.0-85147295543