Baselines for Reinforcement Learning in Text Games
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F18%3A10387543" target="_blank" >RIV/00216208:11320/18:10387543 - isvavai.cz</a>
Result on the web
<a href="http://dx.doi.org/10.1109/ICTAI.2018.00058" target="_blank" >http://dx.doi.org/10.1109/ICTAI.2018.00058</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1109/ICTAI.2018.00058" target="_blank" >10.1109/ICTAI.2018.00058</a>
Alternative languages
Result language
angličtina
Original language name
Baselines for Reinforcement Learning in Text Games
Original language description
The ability to learn optimal control policies in systems where action space is defined by sentences in natural language would allow many interesting real-world applications such as automatic optimisation of dialogue systems. Text-based games with multiple endings and rewards are a promising platform for this task, since their feedback allows us to employ reinforcement learning techniques to jointly learn text representations and control policies. We argue that the key property of AI agents, especially in the text-games context, is their ability to generalise to previously unseen games. We present a minimalistic text-game playing agent, testing its generalisation and transfer learning performance and showing its ability to play multiple games at once. We also present pyfiction, an open-source library for universal access to different text games that could, together with our agent that implements its interface, serve as a baseline for future research.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
<a href="/en/project/GJ17-17125Y" target="_blank" >GJ17-17125Y: Balancing Deliberative and Reactive Behaviour of Intelligent Agents</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2018
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of ICTAI 2018 : International Conference on Tools with Artificial Intelligence
ISBN
978-1-5386-7449-9
ISSN
1082-3409
e-ISSN
neuvedeno
Number of pages
8
Pages from-to
320-327
Publisher name
IEEE
Place of publication
Volos, Greece
Event location
Volos,Greece
Event date
Nov 5, 2018
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
000457750200048