A New Approach for Semi-automatic Building and Extending a Multilingual Terminology Thesaurus
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14330%2F19%3A00109355" target="_blank" >RIV/00216224:14330/19:00109355 - isvavai.cz</a>
Result on the web
<a href="http://dx.doi.org/10.1142/S0218213019500088" target="_blank" >http://dx.doi.org/10.1142/S0218213019500088</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1142/S0218213019500088" target="_blank" >10.1142/S0218213019500088</a>
Alternative languages
Result language
angličtina
Original language name
A New Approach for Semi-automatic Building and Extending a Multilingual Terminology Thesaurus
Original language description
This paper describes a new system for semi-automatically building, extending and managing a terminological thesaurus---a multilingual terminology dictionary enriched with relationships between the terms themselves to form a thesaurus. The system allows to radically enhance the workflow of current terminology expert groups, where most of the editing decisions still come from introspection. The presented system supplements the lexicographic process with natural language processing techniques, which are seamlessly integrated to the thesaurus editing environment. The system's methodology and the resulting thesaurus are closely connected to new domain corpora in the six languages involved. They are used for term usage examples as well as for the automatic extraction of new candidate terms. The terminological thesaurus is now accessible via a web-based application, which a) presents rich detailed information on each term, b) visualizes term relations, and c) displays real-life usage examples of the term in the domain-related documents and in the context-based similar terms. Furthermore, the specialized corpora are used to detect candidate translations of terms from the central language (Czech) to the other languages (English, French, German, Russian and Slovak) as well as to detect broader Czech terms, which help to place new terms in the actual thesaurus hierarchy. This project has been realized as a terminological thesaurus of land surveying, but the presented tools and methodology are reusable for other terminology domains.
Czech name
—
Czech description
—
Classification
Type
J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
—
Continuities
S - Specificky vyzkum na vysokych skolach
Others
Publication year
2019
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
International Journal on Artificial Intelligence Tools
ISSN
0218-2130
e-ISSN
1793-6349
Volume of the periodical
28
Issue of the periodical within the volume
2
Country of publishing house
US - UNITED STATES
Number of pages
21
Pages from-to
1-21
UT code for WoS article
000463577400004
EID of the result in the Scopus database
2-s2.0-85063990309