Advanced Acoustic Modelling Techniques in MP3 Speech Recognition
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68407700%3A21230%2F15%3A00230829" target="_blank" >RIV/68407700:21230/15:00230829 - isvavai.cz</a>
Result on the web
<a href="http://asmp.eurasipjournals.com/content/2015/1/20" target="_blank" >http://asmp.eurasipjournals.com/content/2015/1/20</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1186/s13636-015-0064-7" target="_blank" >10.1186/s13636-015-0064-7</a>
Alternative languages
Result language
angličtina
Original language name
Advanced Acoustic Modelling Techniques in MP3 Speech Recognition
Original language description
The automatic recognition of MP3 compressed speech presents a challenge to the current systems due to the lossy nature of compression which causes irreversible degradation of the speech wave. This article evaluates the performance of a recognition systemoptimized for MP3 compressed speech with current state-of-the-art acoustic modelling techniques and one specific front-end compensation method. The article concentrates on acoustic model adaptation, discriminative training and additional dithering as aprominent means of compensating for the described distortion in the task of phoneme and large vocabulary continuous speech recognition (LVCSR). The experiments presented on the phoneme task show a dramatic increase of the recognition error for unvoiced speech units as a direct result of compression. The application of acoustic model adaptation has proved to yield the highest relative contribution while the gain of discriminative training diminished with decreasing bit-rate. The applicati
Czech name
—
Czech description
—
Classification
Type
J<sub>x</sub> - Unclassified - Peer-reviewed scientific article (Jimp, Jsc and Jost)
CEP classification
JA - Electronics and optoelectronics
OECD FORD branch
—
Result continuities
Project
—
Continuities
S - Specificky vyzkum na vysokych skolach
Others
Publication year
2015
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
EURASIP Journal on Audio Speech and Music Processing
ISSN
1687-4722
e-ISSN
—
Volume of the periodical
2015:20
Issue of the periodical within the volume
July
Country of publishing house
CH - SWITZERLAND
Number of pages
7
Pages from-to
—
UT code for WoS article
000358610600001
EID of the result in the Scopus database
—