Small and Large Vocabulary Speech Recognition of MP3 Data under Real-Word Conditions: Experimental Study
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68407700%3A21230%2F12%3A00200338" target="_blank" >RIV/68407700:21230/12:00200338 - isvavai.cz</a>
Result on the web
<a href="http://link.springer.com/chapter/10.1007/978-3-642-35755-8_29#" target="_blank" >http://link.springer.com/chapter/10.1007/978-3-642-35755-8_29#</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-642-35755-8_29" target="_blank" >10.1007/978-3-642-35755-8_29</a>
Alternative languages
Result language
angličtina
Original language name
Small and Large Vocabulary Speech Recognition of MP3 Data under Real-Word Conditions: Experimental Study
Original language description
This paper presents the study of speech recognition accuracy both for small and large vocabulary task with respect to different levels of MP3 compression of processed data. The motivation behind the work was to evaluate the usage of ASR system for off-line automatic transcription of recordings collected from standard present MP3 devices under different levels of background noise and channel distortion. Although MP3 may not be an optimal compression algorithm, the performed experiments have prooved thatit does not distort speech signal significantly for higher compression rates. Realized experiments showed also that the accuracy of speech recognition (both small- and large-vocabulary) decreased very slowly for the bit-rate of 24 kbps and higher. However, slightly different setup of speech feature computation is necessary for MP3 speech data, mainly PLP features give significantly better results in comparison to MFCC.
Czech name
—
Czech description
—
Classification
Type
J<sub>x</sub> - Unclassified - Peer-reviewed scientific article (Jimp, Jsc and Jost)
CEP classification
JA - Electronics and optoelectronics
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/GA102%2F08%2F0707" target="_blank" >GA102/08/0707: Speech Recognition under Real-World Conditions</a><br>
Continuities
S - Specificky vyzkum na vysokych skolach
Others
Publication year
2012
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Communications in Computer and Information Science
ISSN
1865-0929
e-ISSN
—
Volume of the periodical
314
Issue of the periodical within the volume
—
Country of publishing house
DE - GERMANY
Number of pages
11
Pages from-to
409-419
UT code for WoS article
000315973800029
EID of the result in the Scopus database
—