Application of POS Tagging in Machine Translation Evaluation
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216275%3A25410%2F16%3A39902476" target="_blank" >RIV/00216275:25410/16:39902476 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Application of POS Tagging in Machine Translation Evaluation
Original language description
The aim of the paper is to present a process of natural language processing in its full extent as well as in machine translation from English language into Slovak as a representative of inflectional language. We aim at the data preparation phase for automatic evaluation of machine translation through POS tagging. The preparation phase for MT evaluation consists of several steps, but only the first step - creation of dataset-parallel corpus is deeply described. We focus on the source text collection of various styles and genres-dataset creation and machine translation collection. Two machine translation systems are used-web SMT Google translator API and MT@EC. As a morphology analyzing tool-TreeTagger is used. The process of dataset creation, which covers not only parallel corpora creation, but also creation of errors' database of Slovak words with morphological annotation, is analyzed. The main contribution consists of a novel approach to research of MT evaluation given by the POS tagging (machine learning methods), to identify differences between MT output and post-edited machine translation output. The ground essential of the research is machine translation errors analysis, their identification and classification, from English language into Slovak.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
IN - Informatics
OECD FORD branch
—
Result continuities
Project
—
Continuities
S - Specificky vyzkum na vysokych skolach
Others
Publication year
2016
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
DIVAI 2016 - 11th International Scientific Conference on Distance Learning in Applied Informatics
ISBN
978-80-7552-249-8
ISSN
2464-7470
e-ISSN
—
Number of pages
9
Pages from-to
471-479
Publisher name
Wolters Kluwer ČR, a. s.
Place of publication
Praha
Event location
Štúrovo
Event date
May 2, 2016
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
000386971000040