Comparison of various approaches to tagging for the inflectional Slovak language
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F25%3AQU27UJUV" target="_blank" >RIV/00216208:11320/25:QU27UJUV - isvavai.cz</a>
Alternative codes found
RIV/00216275:25410/24:39922254
Result on the web
<a href="https://www.scopus.com/inward/record.uri?eid=2-s2.0-85196088719&doi=10.7717%2fPEERJ-CS.2026&partnerID=40&md5=d86c5ff910c37031dd275ab7f425af41" target="_blank" >https://www.scopus.com/inward/record.uri?eid=2-s2.0-85196088719&doi=10.7717%2fPEERJ-CS.2026&partnerID=40&md5=d86c5ff910c37031dd275ab7f425af41</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.7717/PEERJ-CS.2026" target="_blank" >10.7717/PEERJ-CS.2026</a>
Alternative languages
Result language
angličtina
Original language name
Comparison of various approaches to tagging for the inflectional Slovak language
Original language description
Morphological tagging provides essential insights into grammar, structure, and the mutual relationships of words within the sentence. Tagging text in a highly inflectional language presents a challenging task due to word ambiguity. This research aims to compare six different automatic taggers for the inflectional Slovak language, seeking for the most accurate tagger for literary and non-literary texts. Our results indicate that it is useful to differentiate texts into literary and non-literary and subsequently, based on the text style to deploy a tagger. For literary texts, UDPipe2 outperformed others in seven out of nine examined tagset positions. Conversely, for non-literary texts, the RNNTagger exhibited the highest performance in eight out of nine examined tagset positions. The RNNTagger is recommended for both types of the text, the best captures the inflection of the Slovak language, but UDPipe2 demonstrates a higher accuracy for literary texts. Despite dataset size limitations, this study emphasizes the suitability of various taggers for the inflectional languages like Slovak. © Copyright 2024 Benko et al.
Czech name
—
Czech description
—
Classification
Type
J<sub>SC</sub> - Article in a specialist periodical, which is included in the SCOPUS database
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
—
Continuities
—
Others
Publication year
2024
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
PeerJ Computer Science
ISSN
2376-5992
e-ISSN
—
Volume of the periodical
10
Issue of the periodical within the volume
2024
Country of publishing house
US - UNITED STATES
Number of pages
31
Pages from-to
1-31
UT code for WoS article
—
EID of the result in the Scopus database
2-s2.0-85196088719