Speech-level Sentiment Analysis of Parliamentary Debates using Lexicon-based Approaches
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F22%3ATPQF87C8" target="_blank" >RIV/00216208:11320/22:TPQF87C8 - isvavai.cz</a>
Result on the web
<a href="https://nl.ijs.si/jtdh22/pdf/JTDH2022_Meden_Speech-level-Sentiment-Analysis-of-Parliamentary-Debates-using-Lexicon-based-Approaches.pdf" target="_blank" >https://nl.ijs.si/jtdh22/pdf/JTDH2022_Meden_Speech-level-Sentiment-Analysis-of-Parliamentary-Debates-using-Lexicon-based-Approaches.pdf</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Speech-level Sentiment Analysis of Parliamentary Debates using Lexicon-based Approaches
Original language description
Sentiment analysis or Opinion mining is a widely studied research area in the field of Natural Language Processing (NLP) that involves the identification of polarity (positive, negative or neutral sentiments) of the text, usually done on shorter and emotionally charged text, such as tweets and reviews. Parliamentary debates feature longer paragraphs and a very esoteric speaking style of Members of the Parliament (MPs), making them much more complex. The aim of the paper was to explore how and if lexicon-based approaches can handle the extraction of polarity from parliamentary debates, using the sentiment lexicon VADER (Valence Aware Dictionary and sEntiment Reasoner) and the Liu Hu sentiment lexicon. We performed sentiment analysis with both lexicons, together with topic modelling of positive and negative speeches to gain additional insight into the data. Lastly, we measured the performance of both lexicons, where both performed poorly. Results showed that while both VADER and Liu Hu were able to correctly identify the general sentiment of some topics (i.e., matching positive/negative keywords to positive/negative topics), most speeches themselves are very polarizing in nature, shifting perspectives multiple times. Sentiment lexicons failed to recognise the sentiment in parliamentary speeches that might not be extremely expressive or where a larger sum of intensity-boosting positive words are used to express negativity. We conclude that using lexicon-based approaches (such as VADER and Liu Hu) in their unaltered states alone do not suffice when dealing with data like parliamentary debates, at least not without any modification of lexicons.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
—
Continuities
—
Others
Publication year
2022
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Conference on Language Technologies & Digital Humanities
ISBN
978-961-7104-20-2
ISSN
—
e-ISSN
—
Number of pages
8
Pages from-to
323-330
Publisher name
Inštitut za novejšo zgodovino
Place of publication
—
Event location
Ljubljana, Slovenia
Event date
Jan 1, 2022
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—