Towards a Slovene Dependency Treebank
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F06%3A10077908" target="_blank" >RIV/00216208:11320/06:10077908 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Towards a Slovene Dependency Treebank
Original language description
The paper presents the initial release of the Slovene Dependency Treebank, currently containing 2000 sentences or 30.000 words. Our approach to annotation is based on the Prague Dependency Treebank, which serves as an excellent model due to the similarity of the languages, the existence of a detailed annotation guide and an annotation editor. The initial treebank contains a portion of the MULTEXT-East parallel word-level annotated corpus, namely the first part of the Slovene twas first parsed automatically, to arrive at the initial analytic level dependency trees. These were then hand corrected using the treeranslation of Orwell's "1984". This corpus editor TrEd; simultaneously, the Czech annotation manual was modified for Slovene. The current versionis available in XML/TEI, as well as derived formats, and has been used in a comparative evaluation using the MALT parser, and as one of the languages present in the CoNLL-X shared task on dependency parsing. The paper also discusses furth
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
AI - Linguistics
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/1ET101120503" target="_blank" >1ET101120503: Integration of language resources for information extraction from natural texts</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2006
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006)
ISBN
2-9517408-2-4
ISSN
—
e-ISSN
—
Number of pages
4
Pages from-to
—
Publisher name
ELRA
Place of publication
Genova, Italy
Event location
Genova, Italy
Event date
May 22, 2006
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—