All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

Linguistic Injustice in Multilingual Technologies: The TenTen Corpus Family as a Case Study*

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F23%3AMYFXI8YM" target="_blank" >RIV/00216208:11320/23:MYFXI8YM - isvavai.cz</a>

  • Result on the web

    <a href="https://www.taylorfrancis.com/chapters/edit/10.4324/9781003393696-12/linguistic-injustice-multilingual-technologies-david-bordonaba-plou-laila-jreis-navarro" target="_blank" >https://www.taylorfrancis.com/chapters/edit/10.4324/9781003393696-12/linguistic-injustice-multilingual-technologies-david-bordonaba-plou-laila-jreis-navarro</a>

  • DOI - Digital Object Identifier

Alternative languages

  • Result language

    angličtina

  • Original language name

    Linguistic Injustice in Multilingual Technologies: The TenTen Corpus Family as a Case Study*

  • Original language description

    "The aim of this work is twofold. First, to distinguish a phenomenon that produces a new type of linguistic injustice, “the paradox of Anglocentric multilingualism.” This paradox arises when a multilingual philosophy is pursued in constructing complex systems of analysis in the digital environment; however, these systems imply advantages in the study of English over other languages. The injustice derives from a poor level of precision in the output of a technology when analyzing non-English languages. Second, to contend that multilingual DH should deal with the deficiencies of tools’ performance, in addition to those of the language resources, because this disadvantage makes it difficult for any cross-linguistic study to provide reliable empirical data in dis(proving) linguistic intuitions. To illustrate some of the potential problems derived from the paradox, this work will detail the difficulties we have faced in a cross-linguistic study on color terms, when using the Arabic corpus arTenTen and the Spanish corpus esTenTen in Sketch Engine. We will study the different performances of the tool in Arabic and Spanish, compared to English, to signal the weaknesses of this tool in a multilingual arena, enabling its improvement and enriching the critical and inclusive framework of multilingual DH."

  • Czech name

  • Czech description

Classification

  • Type

    C - Chapter in a specialist book

  • CEP classification

  • OECD FORD branch

    10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Result continuities

  • Project

  • Continuities

Others

  • Publication year

    2023

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

  • Book/collection name

    "Multilingual Digital Humanities"

  • ISBN

    978-1-00-339369-6

  • Number of pages of the result

    16

  • Pages from-to

    1-16

  • Number of pages of the book

    264

  • Publisher name

    Routledge

  • Place of publication

  • UT code for WoS chapter