Text Structure and Its Ambiguities: Corpus Annotation as a Helpful Guide
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F24%3A10492920" target="_blank" >RIV/00216208:11320/24:10492920 - isvavai.cz</a>
Result on the web
<a href="https://ceur-ws.org/Vol-3792/invited2.pdf" target="_blank" >https://ceur-ws.org/Vol-3792/invited2.pdf</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Text Structure and Its Ambiguities: Corpus Annotation as a Helpful Guide
Original language description
It is typical for natural languages that their texts can be understood differently by individual recipients. A number of scientific disciplines, from cognitive psychology to linguistics, are devoted to this phenomenon. In this study, we focus mainly on linguistic factors, which may lead to different interpretations of coherence relations in the text (simply speaking, what is related to what and how). This work presents a pilot typological survey of disagreements in Czech corpus annotations of coherence relations (discourse relations, coreference, information structure) and their common features. Polysemy (polyfunctionality) and semantic underspecification of coherent expressions (e.g. discourse connectives), generic / abstract meaning of autosemantic words, presence of attribution constructions, word order as a potential marker of information structure and text size appear to be essential factors for disagreement in interpretation. In addition, subjective reception of the relative importance of differ
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
<a href="/en/project/GA24-11132S" target="_blank" >GA24-11132S: Disagreement in corpus annotation and variation in human understanding of text</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2024
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of the 24th Conference Information Technologies – Applications and Theory (ITAT 2024)
ISBN
—
ISSN
1613-0073
e-ISSN
—
Number of pages
11
Pages from-to
2-12
Publisher name
CEUR-WS.org
Place of publication
Košice, Slovakia
Event location
Drienica, Slovakia
Event date
Sep 20, 2024
Type of event by nationality
CST - Celostátní akce
UT code for WoS article
—