Automatic analysis of caregiver input and child production: Insight into corpus-based research on child language development in Korean
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F61989592%3A15210%2F22%3A73614145" target="_blank" >RIV/61989592:15210/22:73614145 - isvavai.cz</a>
Result on the web
<a href="https://benjamins.com/catalog/kl.20002.shi" target="_blank" >https://benjamins.com/catalog/kl.20002.shi</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1075/kl.20002.shi" target="_blank" >10.1075/kl.20002.shi</a>
Alternative languages
Result language
angličtina
Original language name
Automatic analysis of caregiver input and child production: Insight into corpus-based research on child language development in Korean
Original language description
The present study explores the applicability of Natural Language Processing (NLP) techniques to investigate child corpora in Korean. We employ caregiver input and child production data in the CHILDES database, currently the largest and open-access Korean child corpus data, and apply NLP techniques to the data in two ways: automatic Part-of-Speech tagging by adapting a machine learning algorithm, and (semi-)automatic extraction of constructional patterns expressing a transitive event (active transitive and suffixal passive). As the first empirical report on NLP-assisted analysis of Korean child corpora, this study is expected to reveal its advantages and drawbacks, thereby opening the window to furthering corpus-mediated research on child language development in Korean. Implications of this study’s findings will also contribute to research practice regarding developmental studies on Korean through child corpora, ensuring the reproducibility of procedures and results, which is often lacking in previous corpus-based research on child language development in Korean.
Czech name
—
Czech description
—
Classification
Type
J<sub>ost</sub> - Miscellaneous article in a specialist periodical
CEP classification
—
OECD FORD branch
60203 - Linguistics
Result continuities
Project
—
Continuities
I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace
Others
Publication year
2022
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Korean Linguistics
ISSN
0257-3784
e-ISSN
2212-9731
Volume of the periodical
18
Issue of the periodical within the volume
2
Country of publishing house
KR - KOREA, REPUBLIC OF
Number of pages
34
Pages from-to
"125–158"
UT code for WoS article
000871382900002
EID of the result in the Scopus database
—