Multilingual Clinical NER: Translation or Cross-lingual Transfer?

The result's identifiers

Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F23%3ASR9KPR9Y" target="_blank" >RIV/00216208:11320/23:SR9KPR9Y - isvavai.cz</a>
Result on the web
<a href="https://www.scopus.com/inward/record.uri?eid=2-s2.0-85175474593&partnerID=40&md5=676e67f8858c45aab9d9d558d1dec0bf" target="_blank" >https://www.scopus.com/inward/record.uri?eid=2-s2.0-85175474593&partnerID=40&md5=676e67f8858c45aab9d9d558d1dec0bf</a>
DOI - Digital Object Identifier
—

Alternative languages

Result language
angličtina
Original language name
Multilingual Clinical NER: Translation or Cross-lingual Transfer?
Original language description
"Natural language tasks like Named Entity Recognition (NER) in the clinical domain on non-English texts can be very time-consuming and expensive due to the lack of annotated data. Cross-lingual transfer (CLT) is a way to circumvent this issue thanks to the ability of multilingual large language models to be fine-tuned on a specific task in one language and to provide high accuracy for the same task in another language. However, other methods leveraging translation models can be used to perform NER without annotated data in the target language, by either translating the training set or test set. This paper compares cross-lingual transfer with these two alternative methods, to perform clinical NER in French and in German without any training data in those languages. To this end, we release MedNERF a medical NER test set extracted from French drug prescriptions and annotated with the same guidelines as an English dataset. Through extensive experiments on this dataset and on a German medical dataset (Frei and Kramer, 2021), we show that translation-based methods can achieve similar performance to CLT but require more care in their design. And while they can take advantage of monolingual clinical language models, those do not guarantee better results than large general-purpose multilingual models, whether with cross-lingual transfer or translation. © 2023 Association for Computational Linguistics."
Czech name
—
Czech description
—

Classification

Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Result continuities

Project
—
Continuities
—

Others

Publication year
2023
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

Article name in the collection
"Proc. Annu. Meet. Assoc. Comput Linguist."
ISBN
978-195942988-3
ISSN
0736-587X
e-ISSN
—
Number of pages
23
Pages from-to
289-311
Publisher name
Association for Computational Linguistics (ACL)
Place of publication
—
Event location
Melaka, Malaysia
Event date
Jan 1, 2023
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—

Similar results(10)

Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark Czech Dataset for Cross-lingual Subjectivity Classification Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark

What are you looking for?

Quick search

Smart search

Multilingual Clinical NER: Translation or Cross-lingual Transfer?

The result's identifiers

Alternative languages

Classification

Result continuities

Others

Data specific for result type

Similar results(10)

What are you looking for?

Quick search

Smart search

Result description

The result's identifiers

The result's identifiers

Alternative languages

Alternative languages

Classification

Classification

Result continuities

Result continuities

Others

Others

Data specific for result type

Data specific for result type

Similar results(10)