Using Low-Cost Annotation to Train a Reliable Czech Shallow Parser
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14210%2F13%3A00069444" target="_blank" >RIV/00216224:14210/13:00069444 - isvavai.cz</a>
Result on the web
<a href="http://dx.doi.org/10.1007/978-3-642-40585-3_72" target="_blank" >http://dx.doi.org/10.1007/978-3-642-40585-3_72</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-642-40585-3_72" target="_blank" >10.1007/978-3-642-40585-3_72</a>
Alternative languages
Result language
angličtina
Original language name
Using Low-Cost Annotation to Train a Reliable Czech Shallow Parser
Original language description
Bushbank is a relatively new concept - a type of annotated corpus where annotation is driven by use of automatic tools and the task of human annotators is limited to accepting or rejecting parts of their output. This creates a possibility to obtain annotated corpora of considerable size at relatively low cost. In this paper we ask the question if the Czech Bushbank is reliable enough to be used for a NLP task instead of a traditional corpus with high annotation rigour. We perform evaluation of three different parsers using its shallow syntactic annotation, including a CRF chunker made originally for Polish. The results are very promising, showing that many practical applications could benefit from low-cost annotation.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
AI - Linguistics
OECD FORD branch
—
Result continuities
Project
—
Continuities
N - Vyzkumna aktivita podporovana z neverejnych zdroju
Others
Publication year
2013
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Text, Speech, and Dialogue
ISBN
9783642405846
ISSN
0302-9743
e-ISSN
—
Number of pages
8
Pages from-to
575-582
Publisher name
Springer Berling Heidelberg
Place of publication
Plzeň
Event location
Plzeň
Event date
Jan 1, 2013
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—