Small and Large Vocabulary Speech Recognition of MP3 Data under Real-Word Conditions: Experimental Study
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68407700%3A21230%2F12%3A00200338" target="_blank" >RIV/68407700:21230/12:00200338 - isvavai.cz</a>
Výsledek na webu
<a href="http://link.springer.com/chapter/10.1007/978-3-642-35755-8_29#" target="_blank" >http://link.springer.com/chapter/10.1007/978-3-642-35755-8_29#</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-642-35755-8_29" target="_blank" >10.1007/978-3-642-35755-8_29</a>
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
Small and Large Vocabulary Speech Recognition of MP3 Data under Real-Word Conditions: Experimental Study
Popis výsledku v původním jazyce
This paper presents the study of speech recognition accuracy both for small and large vocabulary task with respect to different levels of MP3 compression of processed data. The motivation behind the work was to evaluate the usage of ASR system for off-line automatic transcription of recordings collected from standard present MP3 devices under different levels of background noise and channel distortion. Although MP3 may not be an optimal compression algorithm, the performed experiments have prooved thatit does not distort speech signal significantly for higher compression rates. Realized experiments showed also that the accuracy of speech recognition (both small- and large-vocabulary) decreased very slowly for the bit-rate of 24 kbps and higher. However, slightly different setup of speech feature computation is necessary for MP3 speech data, mainly PLP features give significantly better results in comparison to MFCC.
Název v anglickém jazyce
Small and Large Vocabulary Speech Recognition of MP3 Data under Real-Word Conditions: Experimental Study
Popis výsledku anglicky
This paper presents the study of speech recognition accuracy both for small and large vocabulary task with respect to different levels of MP3 compression of processed data. The motivation behind the work was to evaluate the usage of ASR system for off-line automatic transcription of recordings collected from standard present MP3 devices under different levels of background noise and channel distortion. Although MP3 may not be an optimal compression algorithm, the performed experiments have prooved thatit does not distort speech signal significantly for higher compression rates. Realized experiments showed also that the accuracy of speech recognition (both small- and large-vocabulary) decreased very slowly for the bit-rate of 24 kbps and higher. However, slightly different setup of speech feature computation is necessary for MP3 speech data, mainly PLP features give significantly better results in comparison to MFCC.
Klasifikace
Druh
J<sub>x</sub> - Nezařazeno - Článek v odborném periodiku (Jimp, Jsc a Jost)
CEP obor
JA - Elektronika a optoelektronika, elektrotechnika
OECD FORD obor
—
Návaznosti výsledku
Projekt
<a href="/cs/project/GA102%2F08%2F0707" target="_blank" >GA102/08/0707: Rozpoznávání mluvené řeči v reálných podmínkách</a><br>
Návaznosti
S - Specificky vyzkum na vysokych skolach
Ostatní
Rok uplatnění
2012
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název periodika
Communications in Computer and Information Science
ISSN
1865-0929
e-ISSN
—
Svazek periodika
314
Číslo periodika v rámci svazku
—
Stát vydavatele periodika
DE - Spolková republika Německo
Počet stran výsledku
11
Strana od-do
409-419
Kód UT WoS článku
000315973800029
EID výsledku v databázi Scopus
—