Building of a Vocabulary for the Automatic Voice-Dictation System

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F46747885%3A24310%2F03%3A00000064" target="_blank" >RIV/46747885:24310/03:00000064 - isvavai.cz</a>
Výsledek na webu
—
DOI - Digital Object Identifier
—

Alternativní jazyky

Jazyk výsledku
angličtina
Název v původním jazyce
Building of a Vocabulary for the Automatic Voice-Dictation System
Popis výsledku v původním jazyce
The article describes a process of creation of a large (800K) vocabulary for a voice-dictation system in Czech language. Such a lexicon has special features different from a lexicon intended for text processing because its look-up entry is a phonetic rather than textual form of a word. It should contain most words and word-forms occurring in standard (non-colloquial) spoken language, it should include separate entries for identical written forms if they are pronounced in different way and, on the otherhand, it can omit different spelling variations of the same word. The main goal of this study is to propose and compare various strategies of compiling such a large-scale Czech vocabulary with the aim to keep it as small as possible and at the same timeto ensure that it covers the maximum of some in advance unknown text.
Název v anglickém jazyce
Building of a Vocabulary for the Automatic Voice-Dictation System
Popis výsledku anglicky
The article describes a process of creation of a large (800K) vocabulary for a voice-dictation system in Czech language. Such a lexicon has special features different from a lexicon intended for text processing because its look-up entry is a phonetic rather than textual form of a word. It should contain most words and word-forms occurring in standard (non-colloquial) spoken language, it should include separate entries for identical written forms if they are pronounced in different way and, on the otherhand, it can omit different spelling variations of the same word. The main goal of this study is to propose and compare various strategies of compiling such a large-scale Czech vocabulary with the aim to keep it as small as possible and at the same timeto ensure that it covers the maximum of some in advance unknown text.

Klasifikace

Druh
D - Stať ve sborníku
CEP obor
JD - Využití počítačů, robotika a její aplikace
OECD FORD obor
—

Návaznosti výsledku

Projekt
—
Návaznosti
Z - Vyzkumny zamer (s odkazem do CEZ)

Ostatní

Rok uplatnění
2003
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název statě ve sborníku
Text, Speech and Dialogue - 6th International Conference, TSD 2003 České Budějovice, Czech Republic, September 2003 Proceedings
ISBN
3-540-20024-X
ISSN
—
e-ISSN
—
Počet stran výsledku
8
Strana od-do
301
Název nakladatele
Springer-Verlag
Místo vydání
Heidelberg
Místo konání akce
České Budějovice
Datum konání akce
3. 9. 2003
Typ akce podle státní příslušnosti
WRD - Celosvětová akce
Kód UT WoS článku
—

Podobné výsledky(10)

Looking for sense in nonsense: Valency of negative forms of nouns and adjectives in the NomVallex lexicon Tvorba slovníku a jazykového modelu pro rozpoznávání řeči s velmi velkým slovníkem v češtině Aplikace počítačové lingvistiky při tvorbě on-line kurzu češtiny mluvtecesky.net

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Building of a Vocabulary for the Automatic Voice-Dictation System

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Údaje specifické pro druh výsledku

Údaje specifické pro druh výsledku

Podobné výsledky(10)