All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

Compiling and annotating a learner corpus for a morphologically rich language - CzeSL, a corpus of non-native Czech

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11210%2F20%3A10419686" target="_blank" >RIV/00216208:11210/20:10419686 - isvavai.cz</a>

  • Alternative codes found

    RIV/00216208:11320/20:10419686

  • Result on the web

    <a href="https://dspace.cuni.cz/handle/20.500.11956/123103" target="_blank" >https://dspace.cuni.cz/handle/20.500.11956/123103</a>

  • DOI - Digital Object Identifier

Alternative languages

  • Result language

    angličtina

  • Original language name

    Compiling and annotating a learner corpus for a morphologically rich language - CzeSL, a corpus of non-native Czech

  • Original language description

    Learner corpora, linguistic collections documenting a language as used by learners, provide an important empirical foundation for language acquisition research and teaching practice. This book presents CzeSL, a corpus of non-native Czech, against the background of theoretical and practical issues in the current learner corpus research. Languages with rich morphology and relatively free word order, including Czech, are particularly challenging for the analysis of learner language. The authors address both the complexity of learner error annotation, describing three complementary annotation schemes, and the complexity of description of non-native Czech in terms of standard linguistic categories. The book discusses in detail practical aspects of the corpus creation: the process of collection and annotation itself, the supporting tools, the resulting data, their formats and search platforms. The chapter on use cases exemplifies the usefulness of learner corpora for teaching, language acquisition research, and computational linguistics. Any researcher developing learner corpora will surely appreciate the concluding chapter listing lessons learned and pitfalls to avoid.

  • Czech name

  • Czech description

Classification

  • Type

    B - Specialist book

  • CEP classification

  • OECD FORD branch

    60203 - Linguistics

Result continuities

  • Project

    Result was created during the realization of more than one project. More information in the Projects tab.

  • Continuities

    P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)

Others

  • Publication year

    2020

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

  • ISBN

    978-80-246-4759-3

  • Number of pages

    281

  • Publisher name

    Karolinum

  • Place of publication

    Praha

  • UT code for WoS book