Reinforcement Learning-based Aggregation for Robot Swarms
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68407700%3A21230%2F24%3A00376056" target="_blank" >RIV/68407700:21230/24:00376056 - isvavai.cz</a>
Result on the web
<a href="https://doi.org/10.1177/10597123231202593" target="_blank" >https://doi.org/10.1177/10597123231202593</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1177/10597123231202593" target="_blank" >10.1177/10597123231202593</a>
Alternative languages
Result language
angličtina
Original language name
Reinforcement Learning-based Aggregation for Robot Swarms
Original language description
Aggregation, the gathering of individuals into a single group as observed in animals such as birds, bees, and amoeba, is known to provide protection against predators or resistance to adverse environmental conditions for the whole. Cue-based aggregation, where environmental cues determine the location of aggregation, is known to be challenging when the swarm density is low. Here, we propose a novel aggregation method applicable to real robots in low-density swarms. Previously, Landmark-Based Aggregation (LBA) method had used odometric dead-reckoning coupled with visual landmarks and yielded better aggregation in low-density swarms. However, the method's performance was affected adversely by odometry drift, jeopardizing its application in real-world scenarios. In this article, a novel Reinforcement Learning-based Aggregation method, RLA, is proposed to increase aggregation robustness, thus making aggregation possible for real robots in low-density swarm settings. Systematic experiments conducted in a kinematic-based simulator and on real robots have shown that the RLA method yielded larger aggregates, is more robust to odometry noise than the LBA method, and adapts better to environmental changes while not being sensitive to parameter tuning, making it better deployable under real-world conditions.
Czech name
—
Czech description
—
Classification
Type
J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
<a href="/en/project/EF16_019%2F0000765" target="_blank" >EF16_019/0000765: Research Center for Informatics</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2024
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Adaptive Behavior
ISSN
1059-7123
e-ISSN
1741-2633
Volume of the periodical
32
Issue of the periodical within the volume
3
Country of publishing house
GB - UNITED KINGDOM
Number of pages
17
Pages from-to
265-281
UT code for WoS article
001066117700001
EID of the result in the Scopus database
2-s2.0-85171266829