All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

EuReCo: Not Building and Yet Using Federated Comparable Corpora for Cross-Linguistic Research

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A90244%2F24%3A10495719" target="_blank" >RIV/00216208:90244/24:10495719 - isvavai.cz</a>

  • Result on the web

    <a href="https://aclanthology.org/2024.bucc-1.10.pdf" target="_blank" >https://aclanthology.org/2024.bucc-1.10.pdf</a>

  • DOI - Digital Object Identifier

Alternative languages

  • Result language

    angličtina

  • Original language name

    EuReCo: Not Building and Yet Using Federated Comparable Corpora for Cross-Linguistic Research

  • Original language description

    This paper gives an overview of recent developments concerning the European Reference Corpus EuReCo, an open long-term initiative aimed at providing and using virtual and dynamically definable comparable corpora based on existing national, reference or other large corpora. Given the problems and shortcomings of other types of multilingual corpora - such as the shining-through effects in parallel corpora or the limitation to web material only in web-based comparable corpora - EuReCo constitutes a unique linguistic resource that offers new perspectives for fine-grained cross-linguistic research. The approach advocated here puts forward new solutions to notorious IPR and licensing issues, as well as to challenges of interoperability. It also addresses methodological questions concerning comparability and representativeness. While the focus of this paper is on EuReCo&apos;s implementation-based approach to ensuring interoperability in a feasible and maintainable way, it also presents preliminary results of pilot comparative studies on light verb constructions in German, Romanian, Hungarian, Polish and Bulgarian, and reports on recent extensions and plans.

  • Czech name

  • Czech description

Classification

  • Type

    D - Article in proceedings

  • CEP classification

  • OECD FORD branch

    60203 - Linguistics

Result continuities

  • Project

  • Continuities

Others

  • Publication year

    2024

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

  • Article name in the collection

    Proceedings of the 17th Workshop on Building and Using Comparable Corpora

  • ISBN

    978-2-493-81431-9

  • ISSN

  • e-ISSN

  • Number of pages

    10

  • Pages from-to

    94-103

  • Publisher name

    ELRA

  • Place of publication

    Torino

  • Event location

    Torino

  • Event date

    May 20, 2024

  • Type of event by nationality

    WRD - Celosvětová akce

  • UT code for WoS article