Building of a Vocabulary for the Automatic Voice-Dictation System
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F46747885%3A24310%2F03%3A00000064" target="_blank" >RIV/46747885:24310/03:00000064 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Building of a Vocabulary for the Automatic Voice-Dictation System
Original language description
The article describes a process of creation of a large (800K) vocabulary for a voice-dictation system in Czech language. Such a lexicon has special features different from a lexicon intended for text processing because its look-up entry is a phonetic rather than textual form of a word. It should contain most words and word-forms occurring in standard (non-colloquial) spoken language, it should include separate entries for identical written forms if they are pronounced in different way and, on the otherhand, it can omit different spelling variations of the same word. The main goal of this study is to propose and compare various strategies of compiling such a large-scale Czech vocabulary with the aim to keep it as small as possible and at the same timeto ensure that it covers the maximum of some in advance unknown text.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
JD - Use of computers, robotics and its application
OECD FORD branch
—
Result continuities
Project
—
Continuities
Z - Vyzkumny zamer (s odkazem do CEZ)
Others
Publication year
2003
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Text, Speech and Dialogue - 6th International Conference, TSD 2003 České Budějovice, Czech Republic, September 2003 Proceedings
ISBN
3-540-20024-X
ISSN
—
e-ISSN
—
Number of pages
8
Pages from-to
301
Publisher name
Springer-Verlag
Place of publication
Heidelberg
Event location
České Budějovice
Event date
Sep 3, 2003
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—