Genres in the Prague Discourse Treebank
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F14%3A10289405" target="_blank" >RIV/00216208:11320/14:10289405 - isvavai.cz</a>
Result on the web
<a href="http://www.lrec-conf.org/proceedings/lrec2014/index.html" target="_blank" >http://www.lrec-conf.org/proceedings/lrec2014/index.html</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Genres in the Prague Discourse Treebank
Original language description
We present the project of classification of Prague Discourse Treebank documents (Czech journalistic texts) for their genres. Our main interest lies in opening the possibility to observe how text coherence is realized in different types (in the genre sense) of language data and, in the future, in exploring the ways of using genres as a feature for multi-sentence-level language technologies. In the paper, we first describe the motivation and the concept of the genre annotation, and briefly introduce the Prague Discourse Treebank. Then, we elaborate on the process of manual annotation of genres in the treebank, from the annotators' manual work to post-annotation checks and to the inter-annotator agreement measurements. The annotated genres are subsequently analyzed together with discourse relations (already annotated in the treebank) - we present distributions of the annotated genres and results of studying distinctions of distributions of discourse relations across the individual genres.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
IN - Informatics
OECD FORD branch
—
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>S - Specificky vyzkum na vysokych skolach
Others
Publication year
2014
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014)
ISBN
978-2-9517408-8-4
ISSN
—
e-ISSN
—
Number of pages
7
Pages from-to
1320-1326
Publisher name
European Language Resources Association
Place of publication
Reykjavík, Iceland
Event location
Reykjavík, Iceland
Event date
May 26, 2014
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—