Typological Challenges for the Application of Multilingual Language Models in the Digital Humanities
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F23%3ANG2I82PF" target="_blank" >RIV/00216208:11320/23:NG2I82PF - isvavai.cz</a>
Result on the web
<a href="https://www.taylorfrancis.com/chapters/edit/10.4324/9781003393696-13/typological-challenges-application-multilingual-language-models-digital-humanities-marcell-richard-fekete-johannes-bjerva-lisa-beinborn" target="_blank" >https://www.taylorfrancis.com/chapters/edit/10.4324/9781003393696-13/typological-challenges-application-multilingual-language-models-digital-humanities-marcell-richard-fekete-johannes-bjerva-lisa-beinborn</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.4324/9781003393696-13" target="_blank" >10.4324/9781003393696-13</a>
Alternative languages
Result language
angličtina
Original language name
Typological Challenges for the Application of Multilingual Language Models in the Digital Humanities
Original language description
"This chapter explores the challenges faced by multilingual digital humanities (DH) researchers when using natural language processing (NLP) techniques in their work. While recent developments in neural language models have improved the usefulness of NLP tools for DH experts working with complex datasets, there is a bias towards languages in the Global North, particularly English. This bias arises from data imbalance, unequal script, and vocabulary coverage, and differences in linguistic typology, leading to poor performance for most of the world’s languages."
Czech name
—
Czech description
—
Classification
Type
C - Chapter in a specialist book
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
—
Continuities
—
Others
Publication year
2023
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Book/collection name
"Multilingual Digital Humanities"
ISBN
978-1-00-339369-6
Number of pages of the result
20
Pages from-to
1-20
Number of pages of the book
254
Publisher name
Taylor & Francis
Place of publication
—
UT code for WoS chapter
—