Unsupervised Word Sense Disambiguation Using Word Embeddings
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F19%3A10405583" target="_blank" >RIV/00216208:11320/19:10405583 - isvavai.cz</a>
Result on the web
<a href="https://fruct.org/publications/fruct25/files/Mor.pdf" target="_blank" >https://fruct.org/publications/fruct25/files/Mor.pdf</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Unsupervised Word Sense Disambiguation Using Word Embeddings
Original language description
Word sense disambiguation is the task of assigning the correct sense of a polysemous word in the context in which it appears. In recent years, word embeddings have been applied successfully to many NLP tasks. Thanks to their ability to capture distributional semantics, more recent attention have been focused on utilizing word embeddings to disambiguate words. In this paper, a novel unsupervised method is proposed to disambiguate words from the first language by deploying a trained word embeddings model of the second language using only a bilingual dictionary. While the translated words are useful clues for the disambiguation process, the main idea of this work is to use the information provided by English-translated surrounding words to disambiguate Persian words using trained English word2vec; well-known word embeddings model. Each translation of the polysemous word is compared against word embeddings of translated surrounding words to calculate word similarity scores and the most similar word to vec
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2019
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of the 25th Conference of Open Innovations Association FRUCT 2019
ISBN
978-952-69244-0-3
ISSN
2305-7254
e-ISSN
—
Number of pages
6
Pages from-to
228-233
Publisher name
Finnish-Russian University Cooperation in Telecommunications
Place of publication
Helsinki, Finland
Event location
Helsinki, Finland
Event date
Nov 5, 2019
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—