LLOD schema for Simplified Offensive Language Taxonomy in multilingual detection and applications
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F23%3AIZJUEJX9" target="_blank" >RIV/00216208:11320/23:IZJUEJX9 - isvavai.cz</a>
Výsledek na webu
<a href="https://www.degruyter.com/document/doi/10.1515/lpp-2023-0016/html" target="_blank" >https://www.degruyter.com/document/doi/10.1515/lpp-2023-0016/html</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1515/lpp-2023-0016" target="_blank" >10.1515/lpp-2023-0016</a>
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
LLOD schema for Simplified Offensive Language Taxonomy in multilingual detection and applications
Popis výsledku v původním jazyce
"The goal of the paper is to present a Simplified Offensive Language (SOL) Taxonomy, its application and testing in the Second Annotation Campaign conducted between March-May 2023 on four languages: English, Czech, Lithuanian, and Polish to be verified and located in LLOD. Making reference to the previous Offensive Language taxonomic models proposed mostly by the same COST Action Nexus Linguarum WG 4.1.1 team, the number and variety of the categories underwent the definitional revision, and the present typology was tested in the annotation on the publicly available offensive language datasets of each of the four languages. The results of the annotation are presented and as they are contained within the accepted statistical values on the inter-annotator agreement in the SOL categories and their aspects, we propose this taxonomy as a core ontology which represents the encoding of the supported offensive languages and justify its use on new data in terms of a more universal Linguistic Linked Open Data (LLOD) schema."
Název v anglickém jazyce
LLOD schema for Simplified Offensive Language Taxonomy in multilingual detection and applications
Popis výsledku anglicky
"The goal of the paper is to present a Simplified Offensive Language (SOL) Taxonomy, its application and testing in the Second Annotation Campaign conducted between March-May 2023 on four languages: English, Czech, Lithuanian, and Polish to be verified and located in LLOD. Making reference to the previous Offensive Language taxonomic models proposed mostly by the same COST Action Nexus Linguarum WG 4.1.1 team, the number and variety of the categories underwent the definitional revision, and the present typology was tested in the annotation on the publicly available offensive language datasets of each of the four languages. The results of the annotation are presented and as they are contained within the accepted statistical values on the inter-annotator agreement in the SOL categories and their aspects, we propose this taxonomy as a core ontology which represents the encoding of the supported offensive languages and justify its use on new data in terms of a more universal Linguistic Linked Open Data (LLOD) schema."
Klasifikace
Druh
J<sub>ost</sub> - Ostatní články v recenzovaných periodicích
CEP obor
—
OECD FORD obor
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Návaznosti výsledku
Projekt
—
Návaznosti
—
Ostatní
Rok uplatnění
2023
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název periodika
"Lodz Papers in Pragmatics"
ISSN
1898-4436
e-ISSN
—
Svazek periodika
19
Číslo periodika v rámci svazku
2
Stát vydavatele periodika
US - Spojené státy americké
Počet stran výsledku
24
Strana od-do
301-324
Kód UT WoS článku
—
EID výsledku v databázi Scopus
—