An integrated explicit and implicit offensive language taxonomy
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F23%3AGMCVCV8M" target="_blank" >RIV/00216208:11320/23:GMCVCV8M - isvavai.cz</a>
Result on the web
<a href="https://www.degruyter.com/document/doi/10.1515/lpp-2023-0002/html?casa_token=_Ejp2AFEf-IAAAAA%3AEZd-DnTZnhT3ccIlkq7_gzJQqu6WEkvFqCdFrGUsOzxNBybnu-VnvRsrc37mzTUEUAxR_wR_S6q_BE0" target="_blank" >https://www.degruyter.com/document/doi/10.1515/lpp-2023-0002/html?casa_token=_Ejp2AFEf-IAAAAA%3AEZd-DnTZnhT3ccIlkq7_gzJQqu6WEkvFqCdFrGUsOzxNBybnu-VnvRsrc37mzTUEUAxR_wR_S6q_BE0</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1515/lpp-2023-0002" target="_blank" >10.1515/lpp-2023-0002</a>
Alternative languages
Result language
angličtina
Original language name
An integrated explicit and implicit offensive language taxonomy
Original language description
"The current study represents an integrated model of explicit and implicit offensive language taxonomy. First, it focuses on a definitional revision and enrichment of the explicit offensive language taxonomy by reviewing the collection of available corpora and comparing tagging schemas applied there. The study relies mainly on the categories originally proposed by Zampieri et al. (2019) in terms of offensive language categorization schemata. After the explanation of semantic differences between particular concepts used in the tagging systems and the analysis of theoretical frameworks, a finite set of classes is presented, which cover aspects of offensive language representation along with linguistically sound explanations (Lewandowska-Tomaszczyk et al. 2021). In the analytic procedure, offensive from non-offensive discourse is first distinguished, with the question of offence Target and the following categorization levels and sublevels. Based on the relevant data generated from Sketch Engine (https://www.sketchengine.eu/ententen-english-corpus/), we propose the concept of offensive language as a superordinate category in our system with a number of hierarchically arranged 17 subcategories. The categories are taxonomically structured into 4 levels and verified with the use of neural-based (lexical) embeddings. Together with a taxonomy of implicit offensive language and its subcategorization levels which has received little scholarly attention until now, the categorization is exemplified in samples of offensive discourses in selected English social media materials, i.e., publicly available 25 web-based hate speech datasets (consult Appendix 1 for a complete list). The offensive category levels (types of offence, targets, etc.) and aspects (offensive language property clusters) as well as the categories of explicitness and implicitness are discussed in the study and the computationally verified integrated explicit and implicit offensive language taxonomy proposed in the study."
Czech name
—
Czech description
—
Classification
Type
J<sub>ost</sub> - Miscellaneous article in a specialist periodical
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
—
Continuities
—
Others
Publication year
2023
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
"Lodz Papers in Pragmatics"
ISSN
1898-4436
e-ISSN
—
Volume of the periodical
19
Issue of the periodical within the volume
1
Country of publishing house
US - UNITED STATES
Number of pages
42
Pages from-to
7-48
UT code for WoS article
—
EID of the result in the Scopus database
—