Word Formation Analyzer for Czech: Automatic Parent Retrieval and Classification of Word Formation Processes
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F22%3A10456878" target="_blank" >RIV/00216208:11320/22:10456878 - isvavai.cz</a>
Result on the web
<a href="https://verso.is.cuni.cz/pub/verso.fpl?fname=obd_publikace_handle&handle=aPNRh9i34E" target="_blank" >https://verso.is.cuni.cz/pub/verso.fpl?fname=obd_publikace_handle&handle=aPNRh9i34E</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.14712/00326585.019" target="_blank" >10.14712/00326585.019</a>
Alternative languages
Result language
angličtina
Original language name
Word Formation Analyzer for Czech: Automatic Parent Retrieval and Classification of Word Formation Processes
Original language description
We present a deep-learning tool called Word Formation Analyzer for Czech, which, given an input lexeme, automatically retrieves the lemma or lemmas from which the input lexeme was formed. We call this task parent retrieval. Furthermore, based on the number of words in the output sequence and its comparison to the input, the input word is classified into one of three categories: compound, derivative or unmotivated. We call this task word formation classification. In the task of parent retrieval, Word Formation Analyzer for Czech achieved an accuracy of 71%. In word formation classification, the tool achieved an accuracy of 87%.
Czech name
—
Czech description
—
Classification
Type
J<sub>ost</sub> - Miscellaneous article in a specialist periodical
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2022
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
The Prague Bulletin of Mathematical Linguistics
ISSN
0032-6585
e-ISSN
1804-0462
Volume of the periodical
Neuveden
Issue of the periodical within the volume
118
Country of publishing house
CZ - CZECH REPUBLIC
Number of pages
19
Pages from-to
55-73
UT code for WoS article
—
EID of the result in the Scopus database
—