Subjects tend to be coded only once: Corpus-based and grammar-based evidence for an efficiency-driven trade-off
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F20%3A10426937" target="_blank" >RIV/00216208:11320/20:10426937 - isvavai.cz</a>
Result on the web
<a href="https://www.aclweb.org/anthology/2020.tlt-1.8" target="_blank" >https://www.aclweb.org/anthology/2020.tlt-1.8</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Subjects tend to be coded only once: Corpus-based and grammar-based evidence for an efficiency-driven trade-off
Original language description
Using data from the World Atlas of Language Structures and the Universal Dependencies treebanks, we provide converging evidence from linguistic typology and comparative corpus linguistics for an efficiency-based trade-off in the encoding of referentially accessible subjects. Specifically, when familiar subjects are marked as bound elements attaching to the verb, the chancesof having obligatory independent subject pronouns decrease significantly across the world’s languages. At the same time, there is a trend against not encoding the subject at all, leading us topostulate an overall tendency to encode familiar subjects once and only once in a neutral topiccomment utterance. This tendency is mirrored in more fine-grained corpus data from Slavic:East Slavic languages, in contrast to the other members of the genus, have past forms withoutverbal subject encoding, and it is precisely with these (former participle) forms that the use ofindependent subject pronouns is significantly higher than with other, non-participial verb forms.By contrast, the occurrence of independent subject pronouns does not differ across various verbforms in other Slavic languages, as none of them has been affected by a loss of verbal subjectencoding.
Czech name
—
Czech description
—
Classification
Type
O - Miscellaneous
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
—
Continuities
—
Others
Publication year
2020
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů