A Systematic Review on Semantic Role Labeling for Information Extraction in Low-Resource Data
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F25%3AUENRIWTI" target="_blank" >RIV/00216208:11320/25:UENRIWTI - isvavai.cz</a>
Výsledek na webu
<a href="https://www.scopus.com/inward/record.uri?eid=2-s2.0-85191327866&doi=10.1109%2fACCESS.2024.3392370&partnerID=40&md5=8dbcb733c50fe5f1cf12cc1fb99694ac" target="_blank" >https://www.scopus.com/inward/record.uri?eid=2-s2.0-85191327866&doi=10.1109%2fACCESS.2024.3392370&partnerID=40&md5=8dbcb733c50fe5f1cf12cc1fb99694ac</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1109/ACCESS.2024.3392370" target="_blank" >10.1109/ACCESS.2024.3392370</a>
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
A Systematic Review on Semantic Role Labeling for Information Extraction in Low-Resource Data
Popis výsledku v původním jazyce
Challenges in the big data phenomenon arise due to the existence of unstructured text data, which is very large, comes from various sources, has various formats, and contains much noise. The complexity of unstructured text data makes it difficult to extract useful information. Therefore, a process is needed to transform it into structured data to be processed further. The information Extraction (IE) process helps to extract relationships, entities, semantic roles, and events from unstructured text data by converting them into structured output. One of IE's tasks is Semantic Role Labeling (SRL), which has a crucial function in identifying semantic roles in a sentence so that it can enrich the understanding of the text. However, much of SRL development focuses on high-resource data, especially in English. The limited development of SRL in specific low-resource languages or domains is a complex challenge. This research aims to conduct a systematic study on the development of SRL for low-resource data, both in low-resource language or domain-specific contexts. The review process was carried out systematically using the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) model, and 54 quality papers were obtained from the filtering process (from 2018 to 2023). We review several essential points, including (1) datasets that are often used for SRL tasks and their labeling strategies for low-resource data, (2) methods that have currently been developed for SRL tasks and learning scenarios when dealing with low-resource data, (4) evaluation metrics, (5) application of SRL tasks. This review is complemented by a discussion of issues and potential solutions for developing SRL on low-resource data to help researchers develop SRL more effectively in dealing with the challenges faced with low-resource data. © 2013 IEEE.
Název v anglickém jazyce
A Systematic Review on Semantic Role Labeling for Information Extraction in Low-Resource Data
Popis výsledku anglicky
Challenges in the big data phenomenon arise due to the existence of unstructured text data, which is very large, comes from various sources, has various formats, and contains much noise. The complexity of unstructured text data makes it difficult to extract useful information. Therefore, a process is needed to transform it into structured data to be processed further. The information Extraction (IE) process helps to extract relationships, entities, semantic roles, and events from unstructured text data by converting them into structured output. One of IE's tasks is Semantic Role Labeling (SRL), which has a crucial function in identifying semantic roles in a sentence so that it can enrich the understanding of the text. However, much of SRL development focuses on high-resource data, especially in English. The limited development of SRL in specific low-resource languages or domains is a complex challenge. This research aims to conduct a systematic study on the development of SRL for low-resource data, both in low-resource language or domain-specific contexts. The review process was carried out systematically using the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) model, and 54 quality papers were obtained from the filtering process (from 2018 to 2023). We review several essential points, including (1) datasets that are often used for SRL tasks and their labeling strategies for low-resource data, (2) methods that have currently been developed for SRL tasks and learning scenarios when dealing with low-resource data, (4) evaluation metrics, (5) application of SRL tasks. This review is complemented by a discussion of issues and potential solutions for developing SRL on low-resource data to help researchers develop SRL more effectively in dealing with the challenges faced with low-resource data. © 2013 IEEE.
Klasifikace
Druh
J<sub>SC</sub> - Článek v periodiku v databázi SCOPUS
CEP obor
—
OECD FORD obor
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Návaznosti výsledku
Projekt
—
Návaznosti
—
Ostatní
Rok uplatnění
2024
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název periodika
IEEE Access
ISSN
2169-3536
e-ISSN
—
Svazek periodika
12
Číslo periodika v rámci svazku
2024
Stát vydavatele periodika
US - Spojené státy americké
Počet stran výsledku
30
Strana od-do
57917-57946
Kód UT WoS článku
—
EID výsledku v databázi Scopus
2-s2.0-85191327866