Evaluating and automating the annotation of a learner corpus

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F46747885%3A24510%2F13%3A%230001084" target="_blank" >RIV/46747885:24510/13:#0001084 - isvavai.cz</a>
Výsledek na webu
—
DOI - Digital Object Identifier
—

Alternativní jazyky

Jazyk výsledku
angličtina
Název v původním jazyce
Evaluating and automating the annotation of a learner corpus
Popis výsledku v původním jazyce
The paper describes CzeSL, a learner corpus of Czech, together with basic properties of its design. It starts with a brief introduction of the project within the context of AKCES, a programme addressing Czech acquisition corpora; in connection with the programme we are also concerned with groups of respondents, including differences due to their L1; further we comment on the choice of sociocultural metadata recorded with each text and related both to the learner and the text production task. Next we describe the intended uses of CzeSL. The main parts of the text deal with transcription and annotation. We explain the issues involved in transcription of the handwritten texts and present the concept of a multi-level annotation scheme including taxonomy ofcaptured errors. We conclude by mentioning results from an evaluation of the error annotation and presenting plans for future research.
Název v anglickém jazyce
Evaluating and automating the annotation of a learner corpus
Popis výsledku anglicky
The paper describes CzeSL, a learner corpus of Czech, together with basic properties of its design. It starts with a brief introduction of the project within the context of AKCES, a programme addressing Czech acquisition corpora; in connection with the programme we are also concerned with groups of respondents, including differences due to their L1; further we comment on the choice of sociocultural metadata recorded with each text and related both to the learner and the text production task. Next we describe the intended uses of CzeSL. The main parts of the text deal with transcription and annotation. We explain the issues involved in transcription of the handwritten texts and present the concept of a multi-level annotation scheme including taxonomy ofcaptured errors. We conclude by mentioning results from an evaluation of the error annotation and presenting plans for future research.

Klasifikace

Druh
D - Stať ve sborníku
CEP obor
AI - Jazykověda
OECD FORD obor
—

Návaznosti výsledku

Projekt
—
Návaznosti
V - Vyzkumna aktivita podporovana z jinych verejnych zdroju

Ostatní

Rok uplatnění
2013
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název statě ve sborníku
Twenty Years of Learner Corpus Research: Looking back, Moving ahead
ISBN
978-2-87558-199-0
ISSN
—
e-ISSN
—
Počet stran výsledku
11
Strana od-do
435-446
Název nakladatele
Presses universitaires de Louvain
Místo vydání
Louvain-la-Neuve
Místo konání akce
Louvain-la-Neuve
Datum konání akce
1. 1. 2011
Typ akce podle státní příslušnosti
WRD - Celosvětová akce
Kód UT WoS článku
—

Podobné výsledky(10)

A learner corpus of Czech: current state and future directions Anotace chybových textů v českém žákovském korpusu K přepisu textů nerodilých mluvčích češtiny pro potřeby žákovského korpusu

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Evaluating and automating the annotation of a learner corpus

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Údaje specifické pro druh výsledku

Údaje specifické pro druh výsledku

Podobné výsledky(10)