Solvers for Mathematical Word Problems in Czech
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68407700%3A21230%2F20%3A00345998" target="_blank" >RIV/68407700:21230/20:00345998 - isvavai.cz</a>
Result on the web
<a href="http://ceur-ws.org/Vol-2718/paper09.pdf" target="_blank" >http://ceur-ws.org/Vol-2718/paper09.pdf</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Solvers for Mathematical Word Problems in Czech
Original language description
We study the task of an automatic evaluation of mathematical word problems, which belongs to the category of natural language processing and has become popular in recent years. Since all the so far published methods were developed for inputs in English, our goal is to review them and propose solutions able to cope with inputs in the Czech language. We face the question whether we can achieve a competitive accuracy for a natural language with flexible word order, and with the assumption that only a relatively small dataset of training and testing data is available. We propose and evaluate two methods. One relies on a rule-based processing of dependency trees computed by UDPipe. The other method builds on machine learning. It transforms word problems into numeric vectors and trains SVM to classify them. We also show that it improves in a combination with a search for predefined sequences of words and word classes, achieving 75% accuracy on our dataset of 500 Czech word problems.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
<a href="/en/project/GA19-21198S" target="_blank" >GA19-21198S: Complex prediction models and their learning from weakly annotated data</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2020
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of the 20th Conference Information Technologies - Applications and Theory (ITAT 2020)
ISBN
—
ISSN
1613-0073
e-ISSN
1613-0073
Number of pages
8
Pages from-to
18-25
Publisher name
CEUR Workshop Proceedings
Place of publication
Aachen
Event location
hotel Tyrapol, Oravská Lesná
Event date
Sep 18, 2020
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—