Benchmark Dataset for Propaganda Detection in Czech Newspaper Texts
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14330%2F19%3A00110579" target="_blank" >RIV/00216224:14330/19:00110579 - isvavai.cz</a>
Result on the web
<a href="https://www.aclweb.org/anthology/R19-1010.pdf" target="_blank" >https://www.aclweb.org/anthology/R19-1010.pdf</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.26615/978-954-452-056-4_010" target="_blank" >10.26615/978-954-452-056-4_010</a>
Alternative languages
Result language
angličtina
Original language name
Benchmark Dataset for Propaganda Detection in Czech Newspaper Texts
Original language description
Propaganda of various pressure groups ranging from big economies to ideological blocks is often presented in a form of objective newspaper texts. However, the real objectivity is here shaded with the support of imbalanced views and distorted attitudes by means of various manipulative stylistic techniques. In the project of Manipulative Propaganda Techniques in the Age of Internet, a new resource for automatic analysis of stylistic mechanisms for influencing the readers’ opinion is developed. In its current version, the resource consists of 7,494 newspaper articles from four selected Czech digital news servers annotated for the presence of specific manipulative techniques. In this paper, we present the current state of the annotations and describe the structure of the dataset in detail. We also offer an evaluation of bag-of-words classification algorithms for the annotated manipulative techniques.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
—
Continuities
S - Specificky vyzkum na vysokych skolach
Others
Publication year
2019
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of Recent Advances in Natural Language Processing, RANLP 2019
ISBN
9789544520557
ISSN
1313-8502
e-ISSN
2603-2813
Number of pages
7
Pages from-to
77-83
Publisher name
INCOMA Ltd.
Place of publication
Varna, Bulgaria
Event location
Varna, Bulgaria
Event date
Jan 1, 2019
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—