All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

Towards a Slovene Dependency Treebank

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F06%3A10077908" target="_blank" >RIV/00216208:11320/06:10077908 - isvavai.cz</a>

  • Result on the web

  • DOI - Digital Object Identifier

Alternative languages

  • Result language

    angličtina

  • Original language name

    Towards a Slovene Dependency Treebank

  • Original language description

    The paper presents the initial release of the Slovene Dependency Treebank, currently containing 2000 sentences or 30.000 words. Our approach to annotation is based on the Prague Dependency Treebank, which serves as an excellent model due to the similarity of the languages, the existence of a detailed annotation guide and an annotation editor. The initial treebank contains a portion of the MULTEXT-East parallel word-level annotated corpus, namely the first part of the Slovene twas first parsed automatically, to arrive at the initial analytic level dependency trees. These were then hand corrected using the treeranslation of Orwell's "1984". This corpus editor TrEd; simultaneously, the Czech annotation manual was modified for Slovene. The current versionis available in XML/TEI, as well as derived formats, and has been used in a comparative evaluation using the MALT parser, and as one of the languages present in the CoNLL-X shared task on dependency parsing. The paper also discusses furth

  • Czech name

  • Czech description

Classification

  • Type

    D - Article in proceedings

  • CEP classification

    AI - Linguistics

  • OECD FORD branch

Result continuities

  • Project

    <a href="/en/project/1ET101120503" target="_blank" >1ET101120503: Integration of language resources for information extraction from natural texts</a><br>

  • Continuities

    P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)

Others

  • Publication year

    2006

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

  • Article name in the collection

    Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006)

  • ISBN

    2-9517408-2-4

  • ISSN

  • e-ISSN

  • Number of pages

    4

  • Pages from-to

  • Publisher name

    ELRA

  • Place of publication

    Genova, Italy

  • Event location

    Genova, Italy

  • Event date

    May 22, 2006

  • Type of event by nationality

    WRD - Celosvětová akce

  • UT code for WoS article