Compensation of Nonlinear Distortions in Speech for Automatic Recognition
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F46747885%3A24220%2F14%3A%230002972" target="_blank" >RIV/46747885:24220/14:#0002972 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Compensation of Nonlinear Distortions in Speech for Automatic Recognition
Original language description
This paper addresses improvement of automatic transcription of speech distorted already during recording or by consequent processing. We focus on distortions that cannot be represented by most often used models, that is, as an additive noise or a linearconvolutive channel distortion. We consider a) signals distorted through overgained microphone preamplifier and b) recordings exhibiting unnatural spectral sparseness, caused by application of excessive denoising or low-bit-rate compression. We demonstrate that these distortions deteriorate ASR accuracy significantly. To compensate, we propose to employ a combination of two general robust speech recognition techniques: a front-end feature normalization method and a channel/speaker adaptation technique.We present a significant improvement of transcription accuracy in the case of lectures distorted during recording, compressed broadcast data and utterances recorded with an inappropriately applied denoising.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
JC - Computer hardware and software
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/TA01011142" target="_blank" >TA01011142: Automatic transcription and indexation of lectures</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2014
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proc. of Telecommunications and Signal Processing (TSP) conference
ISBN
978-80-214-4983-1
ISSN
—
e-ISSN
—
Number of pages
5
Pages from-to
419-423
Publisher name
IEEE
Place of publication
Berlín, Německo
Event location
Berlín, Německo
Event date
Jan 1, 2014
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—