LLOD schema for Simplified Offensive Language Taxonomy in multilingual detection and applications
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14410%2F23%3A00133087" target="_blank" >RIV/00216224:14410/23:00133087 - isvavai.cz</a>
Výsledek na webu
<a href="https://www.degruyter.com/document/doi/10.1515/lpp-2023-0016/html" target="_blank" >https://www.degruyter.com/document/doi/10.1515/lpp-2023-0016/html</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1515/lpp-2023-0016" target="_blank" >10.1515/lpp-2023-0016</a>
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
LLOD schema for Simplified Offensive Language Taxonomy in multilingual detection and applications
Popis výsledku v původním jazyce
The goal of the paper is to present a Simplified Offensive Language (SOL) Taxonomy, its application and testing in the Second Annotation Campaign conducted between March-May 2023 on four languages: English, Czech, Lithuanian, and Polish to be verified and located in LLOD. Making reference to the previous Offensive Language taxonomic models proposed mostly by the same COST Action Nexus Linguarum WG 4.1.1 team, the number and variety of the categories underwent the definitional revision, and the present typology was tested in the annotation on the publicly available offensive language datasets of each of the four languages. The results of the annotation are presented and as they are contained within the accepted statistical values on the inter-annotator agreement in the SOL categories and their aspects, we propose this taxonomy as a core ontology which represents the encoding of the supported offensive languages and justify its use on new data in terms of a more universal Linguistic Linked Open Data (LLOD) schema.
Název v anglickém jazyce
LLOD schema for Simplified Offensive Language Taxonomy in multilingual detection and applications
Popis výsledku anglicky
The goal of the paper is to present a Simplified Offensive Language (SOL) Taxonomy, its application and testing in the Second Annotation Campaign conducted between March-May 2023 on four languages: English, Czech, Lithuanian, and Polish to be verified and located in LLOD. Making reference to the previous Offensive Language taxonomic models proposed mostly by the same COST Action Nexus Linguarum WG 4.1.1 team, the number and variety of the categories underwent the definitional revision, and the present typology was tested in the annotation on the publicly available offensive language datasets of each of the four languages. The results of the annotation are presented and as they are contained within the accepted statistical values on the inter-annotator agreement in the SOL categories and their aspects, we propose this taxonomy as a core ontology which represents the encoding of the supported offensive languages and justify its use on new data in terms of a more universal Linguistic Linked Open Data (LLOD) schema.
Klasifikace
Druh
J<sub>SC</sub> - Článek v periodiku v databázi SCOPUS
CEP obor
—
OECD FORD obor
60203 - Linguistics
Návaznosti výsledku
Projekt
—
Návaznosti
I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace
Ostatní
Rok uplatnění
2023
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název periodika
Lodz Papers in Pragmatics
ISSN
1895-6106
e-ISSN
1898-4436
Svazek periodika
19
Číslo periodika v rámci svazku
2
Stát vydavatele periodika
DE - Spolková republika Německo
Počet stran výsledku
24
Strana od-do
301-324
Kód UT WoS článku
—
EID výsledku v databázi Scopus
2-s2.0-85180448082