Dictionary Express : First Phases Rapid dictionary-making method for European, Asian and other languages
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14210%2F24%3A00137476" target="_blank" >RIV/00216224:14210/24:00137476 - isvavai.cz</a>
Result on the web
<a href="https://asialex2024.org/conference-program/" target="_blank" >https://asialex2024.org/conference-program/</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Dictionary Express : First Phases Rapid dictionary-making method for European, Asian and other languages
Original language description
Dictionary Express (DE) is a new methodology combining automatic tools for lexicography and manual checking (annotation) of words, their forms, usage etc. The main goal of the project is to accelerate dictionary making faster and less demanding by separating the process into simple tasks, as opposed to the traditional dictionaries made entry-by-entry. This means the non-automatic work can be done by a small team of native speakers who are not professional linguists, supervised by a smaller team of developers and lexicographers. The data is acquired from big corpora of current web language usage, which helps the dictionary to be more accurate and up to date with the current language trends. In the past, several "rapid dictionaries" have been created using this method. The time needed to complete a DE project depends on the quality of the tagging of the corpus and the amount of the weekly workload. A DE project for Czech is now in the making, and apart from creating a new Czech dictionary, it focuses on analysing the rapid dictionary-making process and the input/output data. In this paper, we present the main annotation tasks of the DE methodology, the data preparation, and some interesting phenomena that occurred during the first phases of the Czech Dictionary Express.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
60203 - Linguistics
Result continuities
Project
—
Continuities
S - Specificky vyzkum na vysokych skolach
Others
Publication year
2024
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
AsiaLex 2024 Proceedings : Asian Lexicography - Merging cutting-edge and established approaches
ISBN
9784990177126
ISSN
2197-4292
e-ISSN
2197-4306
Number of pages
6
Pages from-to
84-89
Publisher name
Toyo University
Place of publication
Tokyo
Event location
Tokyo
Event date
Sep 12, 2024
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—