What quantifying word order freedom can tell us about dependency corpora
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F23%3AB2TRI3X8" target="_blank" >RIV/00216208:11320/23:B2TRI3X8 - isvavai.cz</a>
Result on the web
<a href="https://www.scopus.com/inward/record.uri?eid=2-s2.0-85175399928&partnerID=40&md5=23b5939f6fdd48a4c16542ccbf1057ff" target="_blank" >https://www.scopus.com/inward/record.uri?eid=2-s2.0-85175399928&partnerID=40&md5=23b5939f6fdd48a4c16542ccbf1057ff</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
What quantifying word order freedom can tell us about dependency corpora
Original language description
"Building upon existing work on word order freedom and syntactic annotation, this paper investigates whether we can differentiate between findings that reveal inherent properties of natural languages and their syntax, and features dependent on annotations used in computing the measures. An existing quantifiable and linguistically interpretable measure of word order freedom in language is applied to take a closer look at the robustness of the basic measure (word order entropy) to variations in dependency corpora used in the analysis. Measures are compared at three levels of generality, applied to corpora annotated according to the Universal Dependencies v1 and v2 annotation guidelines, selecting 31 languages for analysis. Preliminary results show that certain measures, such as subject-object relation order freedom, are sensitive to slight changes in annotation guidelines, while simpler measures are more robust, highlighting aspects of these metrics that should be taken into consideration when using dependency corpora for linguistic analysis and generalisation. © 2023 Association for Computational Linguistics."
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
—
Continuities
—
Others
Publication year
2023
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
"Depling 2023 - Int. Conf. Dependency Linguist., Depling (GURT/SyntaxFest 2023), Proc."
ISBN
978-195942932-6
ISSN
—
e-ISSN
—
Number of pages
14
Pages from-to
54-67
Publisher name
Association for Computational Linguistics
Place of publication
—
Event location
Cham
Event date
Jan 1, 2023
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—