Subjects tend to be coded only once: Corpus-based and grammar-based evidence for an efficiency-driven trade-off
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F20%3A10426937" target="_blank" >RIV/00216208:11320/20:10426937 - isvavai.cz</a>
Výsledek na webu
<a href="https://www.aclweb.org/anthology/2020.tlt-1.8" target="_blank" >https://www.aclweb.org/anthology/2020.tlt-1.8</a>
DOI - Digital Object Identifier
—
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
Subjects tend to be coded only once: Corpus-based and grammar-based evidence for an efficiency-driven trade-off
Popis výsledku v původním jazyce
Using data from the World Atlas of Language Structures and the Universal Dependencies treebanks, we provide converging evidence from linguistic typology and comparative corpus linguistics for an efficiency-based trade-off in the encoding of referentially accessible subjects. Specifically, when familiar subjects are marked as bound elements attaching to the verb, the chancesof having obligatory independent subject pronouns decrease significantly across the world’s languages. At the same time, there is a trend against not encoding the subject at all, leading us topostulate an overall tendency to encode familiar subjects once and only once in a neutral topiccomment utterance. This tendency is mirrored in more fine-grained corpus data from Slavic:East Slavic languages, in contrast to the other members of the genus, have past forms withoutverbal subject encoding, and it is precisely with these (former participle) forms that the use ofindependent subject pronouns is significantly higher than with other, non-participial verb forms.By contrast, the occurrence of independent subject pronouns does not differ across various verbforms in other Slavic languages, as none of them has been affected by a loss of verbal subjectencoding.
Název v anglickém jazyce
Subjects tend to be coded only once: Corpus-based and grammar-based evidence for an efficiency-driven trade-off
Popis výsledku anglicky
Using data from the World Atlas of Language Structures and the Universal Dependencies treebanks, we provide converging evidence from linguistic typology and comparative corpus linguistics for an efficiency-based trade-off in the encoding of referentially accessible subjects. Specifically, when familiar subjects are marked as bound elements attaching to the verb, the chancesof having obligatory independent subject pronouns decrease significantly across the world’s languages. At the same time, there is a trend against not encoding the subject at all, leading us topostulate an overall tendency to encode familiar subjects once and only once in a neutral topiccomment utterance. This tendency is mirrored in more fine-grained corpus data from Slavic:East Slavic languages, in contrast to the other members of the genus, have past forms withoutverbal subject encoding, and it is precisely with these (former participle) forms that the use ofindependent subject pronouns is significantly higher than with other, non-participial verb forms.By contrast, the occurrence of independent subject pronouns does not differ across various verbforms in other Slavic languages, as none of them has been affected by a loss of verbal subjectencoding.
Klasifikace
Druh
O - Ostatní výsledky
CEP obor
—
OECD FORD obor
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Návaznosti výsledku
Projekt
—
Návaznosti
—
Ostatní
Rok uplatnění
2020
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů