All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

Typological Challenges for the Application of Multilingual Language Models in the Digital Humanities

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F23%3ANG2I82PF" target="_blank" >RIV/00216208:11320/23:NG2I82PF - isvavai.cz</a>

  • Result on the web

    <a href="https://www.taylorfrancis.com/chapters/edit/10.4324/9781003393696-13/typological-challenges-application-multilingual-language-models-digital-humanities-marcell-richard-fekete-johannes-bjerva-lisa-beinborn" target="_blank" >https://www.taylorfrancis.com/chapters/edit/10.4324/9781003393696-13/typological-challenges-application-multilingual-language-models-digital-humanities-marcell-richard-fekete-johannes-bjerva-lisa-beinborn</a>

  • DOI - Digital Object Identifier

    <a href="http://dx.doi.org/10.4324/9781003393696-13" target="_blank" >10.4324/9781003393696-13</a>

Alternative languages

  • Result language

    angličtina

  • Original language name

    Typological Challenges for the Application of Multilingual Language Models in the Digital Humanities

  • Original language description

    "This chapter explores the challenges faced by multilingual digital humanities (DH) researchers when using natural language processing (NLP) techniques in their work. While recent developments in neural language models have improved the usefulness of NLP tools for DH experts working with complex datasets, there is a bias towards languages in the Global North, particularly English. This bias arises from data imbalance, unequal script, and vocabulary coverage, and differences in linguistic typology, leading to poor performance for most of the world’s languages."

  • Czech name

  • Czech description

Classification

  • Type

    C - Chapter in a specialist book

  • CEP classification

  • OECD FORD branch

    10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Result continuities

  • Project

  • Continuities

Others

  • Publication year

    2023

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

  • Book/collection name

    "Multilingual Digital Humanities"

  • ISBN

    978-1-00-339369-6

  • Number of pages of the result

    20

  • Pages from-to

    1-20

  • Number of pages of the book

    254

  • Publisher name

    Taylor & Francis

  • Place of publication

  • UT code for WoS chapter