Compensation of Nonlinear Distortions in Speech for Automatic Recognition
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F46747885%3A24220%2F15%3A00003411" target="_blank" >RIV/46747885:24220/15:00003411 - isvavai.cz</a>
Result on the web
<a href="http://dx.doi.org/10.1109/TSP.2015.7296378" target="_blank" >http://dx.doi.org/10.1109/TSP.2015.7296378</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1109/TSP.2015.7296378" target="_blank" >10.1109/TSP.2015.7296378</a>
Alternative languages
Result language
angličtina
Original language name
Compensation of Nonlinear Distortions in Speech for Automatic Recognition
Original language description
This paper addresses improvement of automatic transcription of speech distorted already during recording or by consequent processing. We focus on distortions that cannot be represented by most often used models, that is, as an additive noise or a linear convolutive channel distortion. We consider a) signals distorted through overgained microphone preamplifier and b) recordings exhibiting unnatural spectral sparseness, caused by application of excessive denoising or low-bit-rate compression. We demonstrate that these distortions deteriorate ASR accuracy significantly. To compensate, we propose to employ a combination of two general robust speech recognition techniques: a front-end feature normalization method and a channel/speaker adaptation technique. We present a significant improvement of transcription accuracy in the case of lectures distorted during recording, compressed broadcast data and utterances recorded with an inappropriately applied denoising..
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
JC - Computer hardware and software
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/TA01011142" target="_blank" >TA01011142: Automatic transcription and indexation of lectures</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2015
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
38th International Conference on Telecommunications and Signal Processing, TSP 2015
ISBN
978-1-4799-8498-5
ISSN
—
e-ISSN
—
Number of pages
5
Pages from-to
419-423
Publisher name
Institute of Electrical and Electronics Engineers Inc.
Place of publication
Praha, Česká Republika
Event location
Praha
Event date
—
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—