All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

A morphologically annotated longitudinal corpus of spoken Czech child-adult interactions

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11210%2F24%3A10471403" target="_blank" >RIV/00216208:11210/24:10471403 - isvavai.cz</a>

  • Alternative codes found

    RIV/00216208:11320/25:HSRPS3XP

  • Result on the web

    <a href="https://verso.is.cuni.cz/pub/verso.fpl?fname=obd_publikace_handle&handle=O5tdJDeU9b" target="_blank" >https://verso.is.cuni.cz/pub/verso.fpl?fname=obd_publikace_handle&handle=O5tdJDeU9b</a>

  • DOI - Digital Object Identifier

    <a href="http://dx.doi.org/10.1007/s10579-023-09710-y" target="_blank" >10.1007/s10579-023-09710-y</a>

Alternative languages

  • Result language

    angličtina

  • Original language name

    A morphologically annotated longitudinal corpus of spoken Czech child-adult interactions

  • Original language description

    The paper presents a longitudinal corpus of transcribed spontaneous child-adult interactions in Czech. It consists of 99,388 tokens in 42,103 utterances produced by seven children between ca 1.5 to 3.5 years of age, and 238,211 tokens in 61,252 utterances produced by their close caregivers in everyday situations at home. The corpus covers language production of the children from the mean length of 1.01 word per utterance up to 5.33 words per utterance. The length of the recorded period ranges for individual children from 11 to 27 months. The transcripts of both child and adult utterances were lemmatized and tagged using MorphoDiTa, a tool for automatic morphological analysis of Czech. The annotation was transformed into the MOR format used within CHILDES, a database dedicated to corpora of first language acquisition. Detailed manual checking was performed on the annotation of all children&apos;s utterances. Data from three children were used for a comparison of part-of-speech classification before and after manual checking, data from one child was additionally analyzed for differences in morphological tagging proper. The number of differences was rather low, with (expected) limitations in the areas of part-of-speech classification for uninflected words, annotation of homonymous forms, and annotation of child-specific words. The corpus represents an important contribution to the research of child language with special significance for Slavic languages and other morphologically rich inflecting languages, which are still underrepresented in the study of first language acquisition.

  • Czech name

  • Czech description

Classification

  • Type

    J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database

  • CEP classification

  • OECD FORD branch

    60203 - Linguistics

Result continuities

  • Project

    Result was created during the realization of more than one project. More information in the Projects tab.

  • Continuities

    P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>S - Specificky vyzkum na vysokych skolach<br>I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace

Others

  • Publication year

    2024

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

  • Name of the periodical

    Language Resources and Evaluation

  • ISSN

    1574-020X

  • e-ISSN

    1574-0218

  • Volume of the periodical

    Neuveden

  • Issue of the periodical within the volume

    30.03.2024

  • Country of publishing house

    NL - THE KINGDOM OF THE NETHERLANDS

  • Number of pages

    24

  • Pages from-to

    1-24

  • UT code for WoS article

    001194629700002

  • EID of the result in the Scopus database