Automatic evaluation of surface coherence in L2 texts in Czech
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F16%3A10335521" target="_blank" >RIV/00216208:11320/16:10335521 - isvavai.cz</a>
Result on the web
<a href="http://aclweb.org/anthology/O/O16/O16-1021.pdf" target="_blank" >http://aclweb.org/anthology/O/O16/O16-1021.pdf</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Automatic evaluation of surface coherence in L2 texts in Czech
Original language description
We introduce possibilities of automatic evaluation of surface text coherence (cohesion) in texts written by learners of Czech during certified exams for non-native speakers. On the basis of a corpus analysis, we focus on finding and describing relevant distinctive features for automatic detection of A1-C1 levels (established by CEFR - the Common European Framework of Reference for Languages) in terms of surface text coherence. The CEFR levels are evaluated by human assessors and we try to reach this assessment automatically by using several discourse features like frequency and diversity of discourse connectives, density of discourse relations etc. We present experiments with various features using two machine learning algorithms. Our results of automatic evaluation of CEFR coherence/cohesion marks (compared to human assessment) achieved 73.2% success rate for the detection of A1-C1 levels and 74.9% for the detection of A2-B2 levels.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
AM - Pedagogy and education
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/DG16P02B016" target="_blank" >DG16P02B016: Automatic Evaluation of Text Coherence in Czech</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2016
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of the 28th Conference on Computational Linguistics and Speech Processing ROCLING XXVIII (2016)
ISBN
978-957-30792-9-3
ISSN
—
e-ISSN
—
Number of pages
15
Pages from-to
214-228
Publisher name
The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
Place of publication
Taipei, Taiwan
Event location
Tainan, Taiwan
Event date
Oct 6, 2016
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—