Towards Multilingual Event Extraction Evaluation: A Case Study for the Czech Language
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F15%3A43927662" target="_blank" >RIV/49777513:23520/15:43927662 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Towards Multilingual Event Extraction Evaluation: A Case Study for the Czech Language
Original language description
This paper presents a multilingual corpus of news, annotated with event metadata information. The events in our corpus are from the domain of violence, natural and man-made disasters. The main goal of the corpus is automatic evaluation of event detectionand extraction systems in different languages. As a use case, we take a rule-based event extraction system, extend it to cover a new language, Czech in our case, and evaluate it on the corpus. We explain what needs to be done to cover a new language, especially learning domain-specific dictionaries and event extraction patterns. The evaluation of the Czech system can be viewed as a starting point for further research into the evaluation of multilingual event extraction systems, which is an important stage during the development of such systems. The comparison of the performance for the Czech and English systems indicates the importance for multilingual event extraction evaluation
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
IN - Informatics
OECD FORD branch
—
Result continuities
Project
—
Continuities
R - Projekt Ramcoveho programu EK
Others
Publication year
2015
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE RECENT ADVANCES IN NATURAL LANGUAGE PROCESSING'2015
ISBN
—
ISSN
1313-8502
e-ISSN
—
Number of pages
9
Pages from-to
627-635
Publisher name
INCOMA Ltd.
Place of publication
Shoumen, Bulgaria
Event location
hissar, Bulharsko
Event date
Sep 7, 2015
Type of event by nationality
EUR - Evropská akce
UT code for WoS article
—