Dictionary Express : First Phases Rapid dictionary-making method for European, Asian and other languages
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14210%2F24%3A00137476" target="_blank" >RIV/00216224:14210/24:00137476 - isvavai.cz</a>
Výsledek na webu
<a href="https://asialex2024.org/conference-program/" target="_blank" >https://asialex2024.org/conference-program/</a>
DOI - Digital Object Identifier
—
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
Dictionary Express : First Phases Rapid dictionary-making method for European, Asian and other languages
Popis výsledku v původním jazyce
Dictionary Express (DE) is a new methodology combining automatic tools for lexicography and manual checking (annotation) of words, their forms, usage etc. The main goal of the project is to accelerate dictionary making faster and less demanding by separating the process into simple tasks, as opposed to the traditional dictionaries made entry-by-entry. This means the non-automatic work can be done by a small team of native speakers who are not professional linguists, supervised by a smaller team of developers and lexicographers. The data is acquired from big corpora of current web language usage, which helps the dictionary to be more accurate and up to date with the current language trends. In the past, several "rapid dictionaries" have been created using this method. The time needed to complete a DE project depends on the quality of the tagging of the corpus and the amount of the weekly workload. A DE project for Czech is now in the making, and apart from creating a new Czech dictionary, it focuses on analysing the rapid dictionary-making process and the input/output data. In this paper, we present the main annotation tasks of the DE methodology, the data preparation, and some interesting phenomena that occurred during the first phases of the Czech Dictionary Express.
Název v anglickém jazyce
Dictionary Express : First Phases Rapid dictionary-making method for European, Asian and other languages
Popis výsledku anglicky
Dictionary Express (DE) is a new methodology combining automatic tools for lexicography and manual checking (annotation) of words, their forms, usage etc. The main goal of the project is to accelerate dictionary making faster and less demanding by separating the process into simple tasks, as opposed to the traditional dictionaries made entry-by-entry. This means the non-automatic work can be done by a small team of native speakers who are not professional linguists, supervised by a smaller team of developers and lexicographers. The data is acquired from big corpora of current web language usage, which helps the dictionary to be more accurate and up to date with the current language trends. In the past, several "rapid dictionaries" have been created using this method. The time needed to complete a DE project depends on the quality of the tagging of the corpus and the amount of the weekly workload. A DE project for Czech is now in the making, and apart from creating a new Czech dictionary, it focuses on analysing the rapid dictionary-making process and the input/output data. In this paper, we present the main annotation tasks of the DE methodology, the data preparation, and some interesting phenomena that occurred during the first phases of the Czech Dictionary Express.
Klasifikace
Druh
D - Stať ve sborníku
CEP obor
—
OECD FORD obor
60203 - Linguistics
Návaznosti výsledku
Projekt
—
Návaznosti
S - Specificky vyzkum na vysokych skolach
Ostatní
Rok uplatnění
2024
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název statě ve sborníku
AsiaLex 2024 Proceedings : Asian Lexicography - Merging cutting-edge and established approaches
ISBN
9784990177126
ISSN
2197-4292
e-ISSN
2197-4306
Počet stran výsledku
6
Strana od-do
84-89
Název nakladatele
Toyo University
Místo vydání
Tokyo
Místo konání akce
Tokyo
Datum konání akce
12. 9. 2024
Typ akce podle státní příslušnosti
WRD - Celosvětová akce
Kód UT WoS článku
—