Rolling the DICE on Idiomaticity: How LLMs Fail to Grasp Context

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F25%3AMQJSCA3P" target="_blank" >RIV/00216208:11320/25:MQJSCA3P - isvavai.cz</a>
Výsledek na webu
<a href="http://arxiv.org/abs/2410.16069" target="_blank" >http://arxiv.org/abs/2410.16069</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.48550/arXiv.2410.16069" target="_blank" >10.48550/arXiv.2410.16069</a>

Alternativní jazyky

Jazyk výsledku
angličtina
Název v původním jazyce
Rolling the DICE on Idiomaticity: How LLMs Fail to Grasp Context
Popis výsledku v původním jazyce
Human processing of idioms relies on understanding the contextual sentences in which idioms occur, as well as language-intrinsic features such as frequency and speaker-intrinsic factors like familiarity. While LLMs have shown high performance on idiomaticity detection tasks, this success may be attributed to reasoning shortcuts in existing datasets. To this end, we construct a novel, controlled contrastive dataset designed to test whether LLMs can effectively use context to disambiguate idiomatic meaning. Additionally, we explore how collocational frequency and sentence probability influence model performance. Our findings reveal that LLMs often fail to resolve idiomaticity when it is required to attend to the surrounding context, and that models perform better on sentences that have higher likelihood. The collocational frequency of expressions also impacts performance. We make our code and dataset publicly available.
Název v anglickém jazyce
Rolling the DICE on Idiomaticity: How LLMs Fail to Grasp Context
Popis výsledku anglicky
Human processing of idioms relies on understanding the contextual sentences in which idioms occur, as well as language-intrinsic features such as frequency and speaker-intrinsic factors like familiarity. While LLMs have shown high performance on idiomaticity detection tasks, this success may be attributed to reasoning shortcuts in existing datasets. To this end, we construct a novel, controlled contrastive dataset designed to test whether LLMs can effectively use context to disambiguate idiomatic meaning. Additionally, we explore how collocational frequency and sentence probability influence model performance. Our findings reveal that LLMs often fail to resolve idiomaticity when it is required to attend to the surrounding context, and that models perform better on sentences that have higher likelihood. The collocational frequency of expressions also impacts performance. We make our code and dataset publicly available.

Klasifikace

Druh
O - Ostatní výsledky
CEP obor
—
OECD FORD obor
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Návaznosti výsledku

Projekt
—
Návaznosti
—

Ostatní

Rok uplatnění
2024
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Podobné výsledky(10)

Rolling the DICE on Idiomaticity: How LLMs Fail to Grasp Context Assessing BERT’s sensitivity to idiomaticity Kvazifrazémy v mluveném korpusu (ORAL)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Rolling the DICE on Idiomaticity: How LLMs Fail to Grasp Context

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Podobné výsledky(10)