Benchmark Dataset for Propaganda Detection in Czech Newspaper Texts
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14330%2F19%3A00110579" target="_blank" >RIV/00216224:14330/19:00110579 - isvavai.cz</a>
Výsledek na webu
<a href="https://www.aclweb.org/anthology/R19-1010.pdf" target="_blank" >https://www.aclweb.org/anthology/R19-1010.pdf</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.26615/978-954-452-056-4_010" target="_blank" >10.26615/978-954-452-056-4_010</a>
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
Benchmark Dataset for Propaganda Detection in Czech Newspaper Texts
Popis výsledku v původním jazyce
Propaganda of various pressure groups ranging from big economies to ideological blocks is often presented in a form of objective newspaper texts. However, the real objectivity is here shaded with the support of imbalanced views and distorted attitudes by means of various manipulative stylistic techniques. In the project of Manipulative Propaganda Techniques in the Age of Internet, a new resource for automatic analysis of stylistic mechanisms for influencing the readers’ opinion is developed. In its current version, the resource consists of 7,494 newspaper articles from four selected Czech digital news servers annotated for the presence of specific manipulative techniques. In this paper, we present the current state of the annotations and describe the structure of the dataset in detail. We also offer an evaluation of bag-of-words classification algorithms for the annotated manipulative techniques.
Název v anglickém jazyce
Benchmark Dataset for Propaganda Detection in Czech Newspaper Texts
Popis výsledku anglicky
Propaganda of various pressure groups ranging from big economies to ideological blocks is often presented in a form of objective newspaper texts. However, the real objectivity is here shaded with the support of imbalanced views and distorted attitudes by means of various manipulative stylistic techniques. In the project of Manipulative Propaganda Techniques in the Age of Internet, a new resource for automatic analysis of stylistic mechanisms for influencing the readers’ opinion is developed. In its current version, the resource consists of 7,494 newspaper articles from four selected Czech digital news servers annotated for the presence of specific manipulative techniques. In this paper, we present the current state of the annotations and describe the structure of the dataset in detail. We also offer an evaluation of bag-of-words classification algorithms for the annotated manipulative techniques.
Klasifikace
Druh
D - Stať ve sborníku
CEP obor
—
OECD FORD obor
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Návaznosti výsledku
Projekt
—
Návaznosti
S - Specificky vyzkum na vysokych skolach
Ostatní
Rok uplatnění
2019
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název statě ve sborníku
Proceedings of Recent Advances in Natural Language Processing, RANLP 2019
ISBN
9789544520557
ISSN
1313-8502
e-ISSN
2603-2813
Počet stran výsledku
7
Strana od-do
77-83
Název nakladatele
INCOMA Ltd.
Místo vydání
Varna, Bulgaria
Místo konání akce
Varna, Bulgaria
Datum konání akce
1. 1. 2019
Typ akce podle státní příslušnosti
WRD - Celosvětová akce
Kód UT WoS článku
—