Lexical data mining-based approach for the self-enrichment of LMF standardized dictionaries: Case of the syntactico-semantic knowledge
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F21%3A10441625" target="_blank" >RIV/00216208:11320/21:10441625 - isvavai.cz</a>
Result on the web
<a href="https://verso.is.cuni.cz/pub/verso.fpl?fname=obd_publikace_handle&handle=Itcfb.rlbg" target="_blank" >https://verso.is.cuni.cz/pub/verso.fpl?fname=obd_publikace_handle&handle=Itcfb.rlbg</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1002/cpe.6312" target="_blank" >10.1002/cpe.6312</a>
Alternative languages
Result language
angličtina
Original language name
Lexical data mining-based approach for the self-enrichment of LMF standardized dictionaries: Case of the syntactico-semantic knowledge
Original language description
The LMF ISO standard provides a large cover of lexical knowledge using a fine structure. However, like most of the electronic dictionaries, the available normalized LMF dictionaries comprise only basic morpho-syntactic and semantic knowledge, such as the meanings of lexical entries through the definitions and the associated examples, and sometimes the indication of the synonyms and antonyms. Other sophisticated knowledge, such as the syntactic behaviors, semantic classes and syntactico-semantic links, which are scarce, requires a high expertise and its adding to dictionaries is expensive. In fact in this paper, we propose an approach of lexical data mining of the widely available textual content associated with the meanings, notably in the normalized LMF dictionaries, in order to perform the self-enrichment of these dictionaries. First, we contribute to the enrichment of the syntactic behaviors by linking them to the suitable meanings. Second, we focus on the enrichment of the meanings of LMF lexical entries with semantic classes based on the Gaston Gross semantic classification. Finally, we establish the syntactico-semantic links based on the results of the syntactic and semantic enrichment processes. The proposed approach has been consolidated by an experimentation carried out on an available normalized LMF dictionary for Arabic language.
Czech name
—
Czech description
—
Classification
Type
J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
—
Continuities
—
Others
Publication year
2021
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Concurrency Computation Practice and Experience
ISSN
1532-0626
e-ISSN
1532-0634
Volume of the periodical
33
Issue of the periodical within the volume
17
Country of publishing house
GB - UNITED KINGDOM
Number of pages
32
Pages from-to
e6312
UT code for WoS article
000640935900001
EID of the result in the Scopus database
2-s2.0-85104407123