All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

Unsupervised Word Sense Disambiguation Using Word Embeddings

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F19%3A10405583" target="_blank" >RIV/00216208:11320/19:10405583 - isvavai.cz</a>

  • Result on the web

    <a href="https://fruct.org/publications/fruct25/files/Mor.pdf" target="_blank" >https://fruct.org/publications/fruct25/files/Mor.pdf</a>

  • DOI - Digital Object Identifier

Alternative languages

  • Result language

    angličtina

  • Original language name

    Unsupervised Word Sense Disambiguation Using Word Embeddings

  • Original language description

    Word sense disambiguation is the task of assigning the correct sense of a polysemous word in the context in which it appears. In recent years, word embeddings have been applied successfully to many NLP tasks. Thanks to their ability to capture distributional semantics, more recent attention have been focused on utilizing word embeddings to disambiguate words. In this paper, a novel unsupervised method is proposed to disambiguate words from the first language by deploying a trained word embeddings model of the second language using only a bilingual dictionary. While the translated words are useful clues for the disambiguation process, the main idea of this work is to use the information provided by English-translated surrounding words to disambiguate Persian words using trained English word2vec; well-known word embeddings model. Each translation of the polysemous word is compared against word embeddings of translated surrounding words to calculate word similarity scores and the most similar word to vec

  • Czech name

  • Czech description

Classification

  • Type

    D - Article in proceedings

  • CEP classification

  • OECD FORD branch

    10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Result continuities

  • Project

    Result was created during the realization of more than one project. More information in the Projects tab.

  • Continuities

    P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)

Others

  • Publication year

    2019

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

  • Article name in the collection

    Proceedings of the 25th Conference of Open Innovations Association FRUCT 2019

  • ISBN

    978-952-69244-0-3

  • ISSN

    2305-7254

  • e-ISSN

  • Number of pages

    6

  • Pages from-to

    228-233

  • Publisher name

    Finnish-Russian University Cooperation in Telecommunications

  • Place of publication

    Helsinki, Finland

  • Event location

    Helsinki, Finland

  • Event date

    Nov 5, 2019

  • Type of event by nationality

    WRD - Celosvětová akce

  • UT code for WoS article