TermoUD — a language-independent terminology extraction tool
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F23%3AEM5VFV6W" target="_blank" >RIV/00216208:11320/23:EM5VFV6W - isvavai.cz</a>
Výsledek na webu
<a href="https://www.scopus.com/inward/record.uri?eid=2-s2.0-85159861042&partnerID=40&md5=67a7598d0b6ba31dbdfc96088966fdc3" target="_blank" >https://www.scopus.com/inward/record.uri?eid=2-s2.0-85159861042&partnerID=40&md5=67a7598d0b6ba31dbdfc96088966fdc3</a>
DOI - Digital Object Identifier
—
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
TermoUD — a language-independent terminology extraction tool
Popis výsledku v původním jazyce
"The paper addresses TermoUD — a language-independent terminology extraction tool. Its previous version, i.e. TermoPL (Marciniak et al., 2016; Rychlik et al., 2022), uses language dependent shallow grammar which selects candidate terms. The goal behind the development of TermoUD is to make the procedure as universal as possible, while taking care of the linguistic correctness of selected phrases. The tool is suitable for languages for which the Universal Dependencies (UD) parser exists. We describe a method of candidate term extraction based on UD POS tags and UD relations. The candidate ranking is performed by the C-value metric (contexts counting is adapted to the UD formalism), which doesn’t need any additional language resources. The performance of the tool has been tested on texts in English, French, Dutch, and Slovenian. The results are evaluated on the manually annotated datasets: ACTER, RD-TEC 2.0, GENIA and RSDO5, and compared to those obtained by other tools. © 2023 Association for Computational Linguistics."
Název v anglickém jazyce
TermoUD — a language-independent terminology extraction tool
Popis výsledku anglicky
"The paper addresses TermoUD — a language-independent terminology extraction tool. Its previous version, i.e. TermoPL (Marciniak et al., 2016; Rychlik et al., 2022), uses language dependent shallow grammar which selects candidate terms. The goal behind the development of TermoUD is to make the procedure as universal as possible, while taking care of the linguistic correctness of selected phrases. The tool is suitable for languages for which the Universal Dependencies (UD) parser exists. We describe a method of candidate term extraction based on UD POS tags and UD relations. The candidate ranking is performed by the C-value metric (contexts counting is adapted to the UD formalism), which doesn’t need any additional language resources. The performance of the tool has been tested on texts in English, French, Dutch, and Slovenian. The results are evaluated on the manually annotated datasets: ACTER, RD-TEC 2.0, GENIA and RSDO5, and compared to those obtained by other tools. © 2023 Association for Computational Linguistics."
Klasifikace
Druh
D - Stať ve sborníku
CEP obor
—
OECD FORD obor
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Návaznosti výsledku
Projekt
—
Návaznosti
—
Ostatní
Rok uplatnění
2023
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název statě ve sborníku
"EACL - Conf. Eur. Chapter Assoc. Comput. Linguist., Proc. Syst. Demonstr."
ISBN
978-195942945-6
ISSN
—
e-ISSN
—
Počet stran výsledku
9
Strana od-do
178-186
Název nakladatele
Association for Computational Linguistics (ACL)
Místo vydání
—
Místo konání akce
Dubrovnik
Datum konání akce
1. 1. 2023
Typ akce podle státní příslušnosti
WRD - Celosvětová akce
Kód UT WoS článku
—