All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

Sentiment Analysis of Tweets using Unsupervised Learning Techniques and the K-Means Algorithm

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F22%3AKYX4R37R" target="_blank" >RIV/00216208:11320/22:KYX4R37R - isvavai.cz</a>

  • Result on the web

    <a href="https://repositorio.uwiener.edu.pe/handle/20.500.13053/7150" target="_blank" >https://repositorio.uwiener.edu.pe/handle/20.500.13053/7150</a>

  • DOI - Digital Object Identifier

    <a href="http://dx.doi.org/10.14569/IJACSA.2022.0130669" target="_blank" >10.14569/IJACSA.2022.0130669</a>

Alternative languages

  • Result language

    angličtina

  • Original language name

    Sentiment Analysis of Tweets using Unsupervised Learning Techniques and the K-Means Algorithm

  • Original language description

    Today, web content such as images, text, speeches, and videos are user-generated, and social networks have become increasingly popular as a means for people to share their ideas and opinions. One of the most popular social media for expressing their feelings towards events that occur is Twitter. The main objective of this study is to classify and analyze the content of the affiliates of the Pension and Funds Administration (AFP) published on Twitter. This study incorporates machine learning techniques for data mining, cleaning, tokenization, exploratory analysis, classification, and sentiment analysis. To apply the study and examine the data, Twitter was used with the hashtag #afp, followed by descriptive and exploratory analysis, including metrics of the tweets. Finally, a content analysis was carried out, including word frequency calculation, lemmatization, and classification of words by sentiment, emotions, and word cloud. The study uses tweets published in the month of May 2022. Sentiment distribution was also performed in three polarity classes: positive, neutral, and negative, representing 22%, 4%, and 74% respectively. Supported by the unsupervised learning method and the K-Means algorithm, we were able to determine the number of clusters using the elbow method. Finally, the sentiment analysis and the clusters formed indicate that there is a very pronounced dispersion, the distances are not very similar, even though the data standardization work was carried out.

  • Czech name

  • Czech description

Classification

  • Type

    J<sub>SC</sub> - Article in a specialist periodical, which is included in the SCOPUS database

  • CEP classification

  • OECD FORD branch

    10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Result continuities

  • Project

  • Continuities

Others

  • Publication year

    2022

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

  • Name of the periodical

    International Journal of Advanced Computer Science and Applications

  • ISSN

    2158-107X

  • e-ISSN

    2156-5570

  • Volume of the periodical

    13

  • Issue of the periodical within the volume

    6

  • Country of publishing house

    US - UNITED STATES

  • Number of pages

    8

  • Pages from-to

    571-578

  • UT code for WoS article

  • EID of the result in the Scopus database

    2-s2.0-85133369433