Cross-Lingual Semantic Role Labeling with High-Quality Translated Training Corpus

Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F20%3A10426927" target="_blank" >RIV/00216208:11320/20:10426927 - isvavai.cz</a>
Result on the web
<a href="https://www.aclweb.org/anthology/2020.acl-main.627" target="_blank" >https://www.aclweb.org/anthology/2020.acl-main.627</a>
DOI - Digital Object Identifier
—

Result language
angličtina
Original language name
Cross-Lingual Semantic Role Labeling with High-Quality Translated Training Corpus
Original language description
Many efforts of research are devoted to semantic role labeling (SRL) which is crucial for natural language understanding. Supervised approaches have achieved impressing performances when large-scale corpora are available for resource-rich languages such as English. While for the low-resource languages with no annotated SRL dataset, it is still challenging to obtain competitive performances. Cross-lingual SRL is one promising way to address the problem, which has achieved great advances with the help of model transferring and annotation projection. In this paper, we propose a novel alternative based on corpus translation, constructing high-quality training datasets for the target languages from the source gold-standard SRL annotations. Experimental results on Universal Proposition Bank show that the translation-based method is highly effective, and the automatic pseudo datasets can improve the target-language SRL performances significantly.
Czech name
—
Czech description
—

Type
O - Miscellaneous
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Publication year
2020
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Similar results(10)