Analysis of Edit Operations for Post-editing Systems
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216275%3A25410%2F21%3A39922775" target="_blank" >RIV/00216275:25410/21:39922775 - isvavai.cz</a>
Result on the web
<a href="https://link.springer.com/article/10.1007/s44196-021-00048-3" target="_blank" >https://link.springer.com/article/10.1007/s44196-021-00048-3</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/s44196-021-00048-3" target="_blank" >10.1007/s44196-021-00048-3</a>
Alternative languages
Result language
angličtina
Original language name
Analysis of Edit Operations for Post-editing Systems
Original language description
Post-editing has become an important part not only of translation research but also in the global translation industry. While computer-aided translation tools, such as translation memories, are considered to be part of a translator's work, lately, machine translation (MT) systems have also been accepted by human translators. However, many human translators are still adopting the changes brought by translation technologies to the translation industry. This paper introduces a novel approach for seeking suitable pairs of n-grams when recommending n-grams (corresponding n-grams between MT and post-edited MT) based on the type of text (manual or administrative) and MT system used for machine translation. A tool that recommends and speeds up the correction of MT was developed to help the post-editors with their work. It is based on the analysis of words with the same lemmas and analysis of n-gram recommendations. These recommendations are extracted from sequence patterns of the mismatched words (MisMatch) between MT output and post-edited MT output. The paper aims to show the usage of morphological analysis for recommending the post-edit operations. It describes the usage of mismatched words in the n-gram recommendations for the post-edited MT output. The contribution consists of the methodology for seeking suitable pairs of words, n-grams and additionally the importance of taking into account metadata (the type of the text and/or style and MT system) when recommending post-edited operations.
Czech name
—
Czech description
—
Classification
Type
J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
<a href="/en/project/GA19-15498S" target="_blank" >GA19-15498S: Modelling emotions in verbal and nonverbal managerial communication to predict corporate financial risk</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2021
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
International Journal of Computational Intelligence Systems
ISSN
1875-6891
e-ISSN
1875-6883
Volume of the periodical
14
Issue of the periodical within the volume
1
Country of publishing house
FR - FRANCE
Number of pages
12
Pages from-to
197
UT code for WoS article
000778406100001
EID of the result in the Scopus database
2-s2.0-85119997372