Universal Dependencies and Non-Native Czech
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F18%3A10390152" target="_blank" >RIV/00216208:11320/18:10390152 - isvavai.cz</a>
Result on the web
<a href="http://www.ep.liu.se/ecp/155/ecp18155.pdf" target="_blank" >http://www.ep.liu.se/ecp/155/ecp18155.pdf</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Universal Dependencies and Non-Native Czech
Original language description
CzeSL is a learner corpus of texts produced by non-native speakers of Czech. Such corpora area great source of information about specific features of learners' language, helping language teachers and researchers in the area of second language acquisition. In our project, we have focused on syntactic annotation of the non-native text within the framework of Universal Dependencies. As far as we know, this is a first project annotating a richly inflectional non-native language. Our ideal goal has been to annotate according to the non-native grammar in the mind of the author, not according to the standard grammar. However, this brings many challenges. First, we do not have enough data to get reliable insights into the grammar of each author. Second, many phenomena are far more complicated than they are in native languages. We believe that the most important result of this project is not the actual annotation, but the guidelines and principles that can be used as a basis for other non-native languages.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2018
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of the 17th International Workshop on Treebanks and Linguistic Theories (TLT 2018)
ISBN
978-91-7685-137-1
ISSN
1650-3740
e-ISSN
neuvedeno
Number of pages
10
Pages from-to
105-114
Publisher name
Linköping University Electronic Press
Place of publication
Linköping, Sweden
Event location
Oslo, Norway
Event date
Dec 13, 2018
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—