Language and Task Arithmetic with Parameter-Efficient Layers for Zero-Shot Summarization

Popis výsledku

—

Klíčová slova

Identifikátory výsledku

Kód výsledku v IS VaVaI
RIV/00216208:11320/23:DLF75ZZZ - isvavai.cz
Výsledek na webu
http://arxiv.org/abs/2311.09344
DOI - Digital Object Identifier
—

Alternativní jazyky

Jazyk výsledku
švédština
Název v původním jazyce
Language and Task Arithmetic with Parameter-Efficient Layers for Zero-Shot Summarization
Popis výsledku v původním jazyce
"Parameter-efficient fine-tuning (PEFT) using labeled task data can significantly improve the performance of large language models (LLMs) on the downstream task. However, there are 7000 languages in the world and many of these languages lack labeled data for real-world language generation tasks. In this paper, we propose to improve zero-shot cross-lingual transfer by composing language or task specialized parameters. Our method composes language and task PEFT modules via element-wise arithmetic operations to leverage unlabeled data and English labeled data. We extend our approach to cases where labeled data from more languages is available and propose to arithmetically compose PEFT modules trained on languages related to the target. Empirical results on summarization demonstrate that our method is an effective strategy that obtains consistent gains using minimal training of PEFT modules."
Název v anglickém jazyce
Language and Task Arithmetic with Parameter-Efficient Layers for Zero-Shot Summarization
Popis výsledku anglicky
"Parameter-efficient fine-tuning (PEFT) using labeled task data can significantly improve the performance of large language models (LLMs) on the downstream task. However, there are 7000 languages in the world and many of these languages lack labeled data for real-world language generation tasks. In this paper, we propose to improve zero-shot cross-lingual transfer by composing language or task specialized parameters. Our method composes language and task PEFT modules via element-wise arithmetic operations to leverage unlabeled data and English labeled data. We extend our approach to cases where labeled data from more languages is available and propose to arithmetically compose PEFT modules trained on languages related to the target. Empirical results on summarization demonstrate that our method is an effective strategy that obtains consistent gains using minimal training of PEFT modules."

Klasifikace

Druh
O - Ostatní výsledky
CEP obor
—
OECD FORD obor
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Návaznosti výsledku

Projekt
—
Návaznosti
—

Ostatní

Rok uplatnění
2023
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Základní informace

Druh výsledku

O - Ostatní výsledky

OECD FORD

Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Rok uplatnění

2023

Podobné výsledky(10)

Adjusting BERT’s Pooling Layer for Large-Scale Multi-Label Text Classification Discovering Dialogue Slots with Weak Supervision Automatic Processing Pipeline for Collecting and Annotating Air-Traffic Voice Communication Data

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Sdílet výsledky vyhledávání

Language and Task Arithmetic with Parameter-Efficient Layers for Zero-Shot Summarization

Popis výsledku

Klíčová slova

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Základní informace

Podobné výsledky(10)