Coreference Resolution System Not Only for Czech
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F17%3A10372164" target="_blank" >RIV/00216208:11320/17:10372164 - isvavai.cz</a>
Result on the web
<a href="http://ceur-ws.org/Vol-1885/193.pdf" target="_blank" >http://ceur-ws.org/Vol-1885/193.pdf</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Coreference Resolution System Not Only for Czech
Original language description
The paper introduces Treex CR, a coreference resolution (CR) system not only for Czech. As its name suggests, it has been implemented as an integral part of the Treex NLP framework. The main feature that distinguishes it from other CR systems is that it operates on the tectogrammatical layer, a representation of deep syntax. This feature allows for natural handling of elided expressions, e.g. unexpressed subjects in Czech as well as generally ignored English anaphoric expression - relative pronouns and zeros. The system implements a sequence of mention ranking models specialized at particular types of coreferential expressions (relative, reflexive, personal pronouns etc.). It takes advantage of rich feature set extracted from the data linguistically preprocessed with Treex. We evaluated Treex CR on Czech and English datasets and compared it with other systems as well as with modules used in Treex so far.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
<a href="/en/project/GA16-05394S" target="_blank" >GA16-05394S: Structure of coreferential chains in parallel language data</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2017
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of the 17th conference ITAT 2017: Slovenskočeský NLP workshop (SloNLP 2017)
ISBN
978-1-974274-74-1
ISSN
1613-0073
e-ISSN
neuvedeno
Number of pages
8
Pages from-to
193-200
Publisher name
CreateSpace Independent Publishing Platform
Place of publication
Praha, Czechia
Event location
Martinské hole, Malá Fatra, Slovakia
Event date
Sep 23, 2017
Type of event by nationality
CST - Celostátní akce
UT code for WoS article
—