Investigating interoperable event corpora: limitations of reusability of resources and portability of models
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F23%3A9M7KNHL5" target="_blank" >RIV/00216208:11320/23:9M7KNHL5 - isvavai.cz</a>
Result on the web
<a href="https://link.springer.com/10.1007/s10579-023-09643-6" target="_blank" >https://link.springer.com/10.1007/s10579-023-09643-6</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/s10579-023-09643-6" target="_blank" >10.1007/s10579-023-09643-6</a>
Alternative languages
Result language
angličtina
Original language name
Investigating interoperable event corpora: limitations of reusability of resources and portability of models
Original language description
"Abstractn Studies on the applicability of heterogeneous semantically interoperable corpora are rare. We investigate to what extent reusability (both of systems and of annotations) is entailed by corpora whose interoperability is based on compliance to standards. In particular, we look at event detection in English texts, supported by the ISO-TimeML annotation scheme. We run two sets of experiments using a common neural network architecture and extensively evaluate our results on both in-distribution and out-of-distribution settings. In all experimental settings, systems obtain state-of-the-art results on the in-distribution data and underperform out-of-distribution ones, setting limits to the benefits of semantically interoperable corpora. By means of a detailed error analysis, we show that while being compliant to a standard guarantees semantic interoperability, this becomes only a necessary condition for reusability, with factors such as differences in the quality of the annotations having a much stronger impact."
Czech name
—
Czech description
—
Classification
Type
J<sub>ost</sub> - Miscellaneous article in a specialist periodical
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
—
Continuities
—
Others
Publication year
2023
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
"Language Resources and Evaluation"
ISSN
1574-020X
e-ISSN
—
Volume of the periodical
57
Issue of the periodical within the volume
3
Country of publishing house
US - UNITED STATES
Number of pages
31
Pages from-to
1107-1137
UT code for WoS article
—
EID of the result in the Scopus database
—