Evaluating and automating the annotation of a learner corpus
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F13%3A10194812" target="_blank" >RIV/00216208:11320/13:10194812 - isvavai.cz</a>
Alternative codes found
RIV/00216208:11210/13:10194812 RIV/46747885:24510/13:#0001083
Result on the web
<a href="http://dx.doi.org/10.1007/s10579-013-9226-3" target="_blank" >http://dx.doi.org/10.1007/s10579-013-9226-3</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/s10579-013-9226-3" target="_blank" >10.1007/s10579-013-9226-3</a>
Alternative languages
Result language
angličtina
Original language name
Evaluating and automating the annotation of a learner corpus
Original language description
The paper describes a corpus of texts produced by non-native speakers of Czech. We discuss its annotation scheme, consisting of three interlinked tiers, designed to handle a wide range of error types present in the input. Each tier corrects different types of errors; links between the tiers allow capturing errors in word order and complex discontinuous expressions. Errors are not only corrected, but also classified. The annotation scheme is tested on a data set including approx. 175,000 words with fairinter-annotator agreement results. We also explore the possibility of applying automated linguistic annotation tools (taggers, spell checkers and grammar checkers) to the learner text to support or even substitute manual annotation.
Czech name
—
Czech description
—
Classification
Type
J<sub>x</sub> - Unclassified - Peer-reviewed scientific article (Jimp, Jsc and Jost)
CEP classification
AI - Linguistics
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/GPP406%2F10%2FP328" target="_blank" >GPP406/10/P328: Resource-light Morphological Analysis and Tagging</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2013
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Language Resources and Evaluation
ISSN
1574-020X
e-ISSN
—
Volume of the periodical
47
Issue of the periodical within the volume
1
Country of publishing house
NL - THE KINGDOM OF THE NETHERLANDS
Number of pages
2
Pages from-to
1-2
UT code for WoS article
—
EID of the result in the Scopus database
—