Robust Multilingual Statistical Morphological Generation Models
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F13%3A10194629" target="_blank" >RIV/00216208:11320/13:10194629 - isvavai.cz</a>
Result on the web
<a href="http://aclweb.org/anthology/P/P13/P13-3023.pdf" target="_blank" >http://aclweb.org/anthology/P/P13/P13-3023.pdf</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Robust Multilingual Statistical Morphological Generation Models
Original language description
We present a novel method of statistical morphological generation, i.e. the prediction of inflected word forms given lemma, part-of-peech and morphological features, aimed at robustness to unseen inputs. Our system uses a trainable classifier to predict"edit scripts" that are then used to transform lemmas into inflected word forms. Suffixes of lemmas are included as features to achieve robustness. We evaluate our system on 6 languages with a varying degree of morphological richness. The results show that the system is able to learn most morphological phenomena and generalize to unseen inputs, producing significantly better results than a dictionary-based baseline.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
IN - Informatics
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/LK11221" target="_blank" >LK11221: Development of statistical methods for spoken dalogue systems</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2013
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
51st Annual Meeting of the Association for Computational Linguistics Proceedings of the Student Research Workshop
ISBN
978-1-937284-53-4
ISSN
—
e-ISSN
—
Number of pages
7
Pages from-to
158-164
Publisher name
Association for Computational Linguistics
Place of publication
Sofija, Bulgaria
Event location
Sofija, Bulgaria
Event date
Aug 5, 2013
Type of event by nationality
CST - Celostátní akce
UT code for WoS article
—