Accuracy of MP3 Speech Recognition Under Real-World Conditions. Experimental Study
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68407700%3A21230%2F11%3A00181583" target="_blank" >RIV/68407700:21230/11:00181583 - isvavai.cz</a>
Result on the web
<a href="http://www.sigmap.icete.org" target="_blank" >http://www.sigmap.icete.org</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Accuracy of MP3 Speech Recognition Under Real-World Conditions. Experimental Study
Original language description
This paper presents the study of speech recognition accuracy with respect to different levels of MP3 compression. Special attention is focused on the processing of speech signals with different quality, i.e. with different level of background noise and channel distortion. The work was motivated by possible usage of ASR for offline automatic transcription of audio recordings collected by standard wide-spread MP3 devices. The realized experiments have proved that although MP3 format does not distort speech significantly especially for high or moderate bit rates and high quality of source data. The accuracy of connected digits ASR decreased very slowly up to the bit rate 24 kbps. For the best case of PLP parameterization in close-talk channel just 3% decrease of recognition accuracy was observed while the size of the compressed file was approximately 10% of the original size. All results were slightly worse under presence of additive background noise and channel distortion.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
JA - Electronics and optoelectronics
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/GA102%2F08%2F0707" target="_blank" >GA102/08/0707: Speech Recognition under Real-World Conditions</a><br>
Continuities
Z - Vyzkumny zamer (s odkazem do CEZ)
Others
Publication year
2011
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of SIGMAP 2011 - International Conference on Signal Processing and Multimedia Applications.
ISBN
978-989-8425-72-0
ISSN
—
e-ISSN
—
Number of pages
6
Pages from-to
5-10
Publisher name
University of Seville
Place of publication
Sevilla
Event location
Sevilla
Event date
Jul 18, 2011
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—