Do UD Trees Match Mention Spans in Coreference Annotations?
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F21%3A10440572" target="_blank" >RIV/00216208:11320/21:10440572 - isvavai.cz</a>
Result on the web
<a href="https://aclanthology.org/2021.findings-emnlp.303/" target="_blank" >https://aclanthology.org/2021.findings-emnlp.303/</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Do UD Trees Match Mention Spans in Coreference Annotations?
Original language description
One can find dozens of data resources for various languages in which coreference - a relation between two or more expressions that refer to the same real-world entity - is manually annotated. One could also assume that such expressions usually constitute syntactically meaningful units; however, mention spans have been annotated simply by delimiting token intervals in most coreference projects, i.e., independently of any syntactic representation. We argue that it could be advantageous to make syntactic and coreference annotations convergent in the long term. We present a pilot empirical study focused on matches and mismatches between hand-annotated linear mention spans and automatically parsed syntactic trees that follow Universal Dependencies conventions. 8 datasets for 7 different languages are included in the study.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2021
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Findings of the Association for Computational Linguistics: EMNLP 2021
ISBN
978-1-955917-10-0
ISSN
—
e-ISSN
—
Number of pages
7
Pages from-to
3570-3576
Publisher name
Association for Computational Linguistics
Place of publication
Stroudsburg, PA, USA
Event location
Punta Cana, Dominican Republic
Event date
Nov 7, 2021
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—