Large vocabulary ASR for spontaneous Czech in the MALACH project
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F03%3A00000157" target="_blank" >RIV/49777513:23520/03:00000157 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Large vocabulary ASR for spontaneous Czech in the MALACH project
Original language description
This paper describes LVCSR research into the automatic transcription of spontaneous Czech speech in the MALACH project. This project attempts to provide improved acces to the large multilingual spoken archives collected by the Survivors of the Shoah Visual History Foundation by advancing the state of the art in automated speech recognition. We describe a baseline ASR system and discuss the problems in language modeling that arise from the nature of Czech as a highly inflectional language that also exhibits diglossia between its written and spontaneous forms.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
JD - Use of computers, robotics and its application
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/LN00A063" target="_blank" >LN00A063: Centre of Computational Linguistics</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>Z - Vyzkumny zamer (s odkazem do CEZ)
Others
Publication year
2003
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
EUROSPEECH 2003 PROCEEDINGS
ISBN
—
ISSN
—
e-ISSN
—
Number of pages
4
Pages from-to
1821-1824
Publisher name
ISCA
Place of publication
Geneva
Event location
Geneva
Event date
Sep 1, 2003
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—