Keyword Extraction for Automatic Evaluation of Machine Translation
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F22%3AC3R9FW4Z" target="_blank" >RIV/00216208:11320/22:C3R9FW4Z - isvavai.cz</a>
Výsledek na webu
<a href="https://nlp.fi.muni.cz/raslan/2022/paper3.pdf" target="_blank" >https://nlp.fi.muni.cz/raslan/2022/paper3.pdf</a>
DOI - Digital Object Identifier
—
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
Keyword Extraction for Automatic Evaluation of Machine Translation
Popis výsledku v původním jazyce
Evaluation plays a key role in the field of machine translation. In general, the evaluation of machine translation can be divided into two types, manual (human) and automatic (machine). Professional human translators can understand and evaluate the text with the best results in terms of measuring quality and analyzing errors but on the other hand, this approach brings a number of disadvantages, including high time consumption, the subjectivity of the translator, and the finan-cial costs associated with hiring professional translators. Automatic evaluation approaches are usually based on the correlation between the sentences or n-grams from human translation and machine translation. The aim of this paper is to capture the semantics of human translation from the English language to the Slovak language and the same text translated by ETransL and Deepl translating engines by extracting the keywords which represent the main phrases from doc-uments to determine how much the machine translations differ from the reference human translation. Based on our results the translations are equal from a seman-tic point of view and the end user should understand the text translated by ETransL and Deepl equally as human translation.
Název v anglickém jazyce
Keyword Extraction for Automatic Evaluation of Machine Translation
Popis výsledku anglicky
Evaluation plays a key role in the field of machine translation. In general, the evaluation of machine translation can be divided into two types, manual (human) and automatic (machine). Professional human translators can understand and evaluate the text with the best results in terms of measuring quality and analyzing errors but on the other hand, this approach brings a number of disadvantages, including high time consumption, the subjectivity of the translator, and the finan-cial costs associated with hiring professional translators. Automatic evaluation approaches are usually based on the correlation between the sentences or n-grams from human translation and machine translation. The aim of this paper is to capture the semantics of human translation from the English language to the Slovak language and the same text translated by ETransL and Deepl translating engines by extracting the keywords which represent the main phrases from doc-uments to determine how much the machine translations differ from the reference human translation. Based on our results the translations are equal from a seman-tic point of view and the end user should understand the text translated by ETransL and Deepl equally as human translation.
Klasifikace
Druh
O - Ostatní výsledky
CEP obor
—
OECD FORD obor
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Návaznosti výsledku
Projekt
—
Návaznosti
—
Ostatní
Rok uplatnění
2022
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů