Improvements to Dependency Parsing Using Automatic Simplification of Data
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11210%2F14%3A10289998" target="_blank" >RIV/00216208:11210/14:10289998 - isvavai.cz</a>
Result on the web
<a href="http://www.lrec-conf.org/proceedings/lrec2014/pdf/228_Paper.pdf" target="_blank" >http://www.lrec-conf.org/proceedings/lrec2014/pdf/228_Paper.pdf</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Improvements to Dependency Parsing Using Automatic Simplification of Data
Original language description
The paper presents a method of improving dependency parsing using automatic simplification of data. Language data are often too complex (and too sparse) for parsers to cope with. The paper shows that by means of small, reversible simplifications of the text and of the annotation, a considerable improvement of parsing accuracy can be achieved. In order to facilitate the task of language modeling performed by the parser, I reduce variability of lemmas and word forms in the text. I modify the system of morphological annotation to make it more suitable for parsing. Finally, the dependency annotation scheme is also partially modified. All such modifications are automatic and fully reversible: after the parsing is done, the original data and structures are automatically restored. With MaltParser, I achieve an 8.3% error rate reduction.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
AI - Linguistics
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/GA13-27184S" target="_blank" >GA13-27184S: Grammar-based treebank of Czech</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2014
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
ISBN
978-2-9517408-8-4
ISSN
—
e-ISSN
—
Number of pages
5
Pages from-to
73-77
Publisher name
European Language Resources Association (ELRA)
Place of publication
Reykjavík, Island
Event location
Reykjavík, Island
Event date
May 26, 2014
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—