Analysis and automatic recognition of compressed speech
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68407700%3A21230%2F15%3A00230135" target="_blank" >RIV/68407700:21230/15:00230135 - isvavai.cz</a>
Výsledek na webu
—
DOI - Digital Object Identifier
—
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
Analysis and automatic recognition of compressed speech
Popis výsledku v původním jazyce
The deployment of automatic speech recognition (ASR) systems into real-life are often met with difficulties of diverse acoustic conditions. This diversity is what forces the necessity to build the systems as robust to ensure their reliable performance regardless of the conditions. The usage of MP3 compression represents one of such conditions, when the property of lossy encoding degrades the quality of extracted features and therefore the recognition. The research of optimized settings for MP3 recognition has been conducted by various authors and different solutions have been proposed. This work presents the analysis of optimized setup which was focused on blocks of feature extraction and acoustic modeling. The work summarizes the effects of methods proposed the author and other authors, all tested to determine the potential contribution of each method separately as well as in unison. The main goal of the optimization was to find the proper segmentation, determine the importance of fea
Název v anglickém jazyce
Analysis and automatic recognition of compressed speech
Popis výsledku anglicky
The deployment of automatic speech recognition (ASR) systems into real-life are often met with difficulties of diverse acoustic conditions. This diversity is what forces the necessity to build the systems as robust to ensure their reliable performance regardless of the conditions. The usage of MP3 compression represents one of such conditions, when the property of lossy encoding degrades the quality of extracted features and therefore the recognition. The research of optimized settings for MP3 recognition has been conducted by various authors and different solutions have been proposed. This work presents the analysis of optimized setup which was focused on blocks of feature extraction and acoustic modeling. The work summarizes the effects of methods proposed the author and other authors, all tested to determine the potential contribution of each method separately as well as in unison. The main goal of the optimization was to find the proper segmentation, determine the importance of fea
Klasifikace
Druh
C - Kapitola v odborné knize
CEP obor
IN - Informatika
OECD FORD obor
—
Návaznosti výsledku
Projekt
—
Návaznosti
S - Specificky vyzkum na vysokych skolach
Ostatní
Rok uplatnění
2015
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název knihy nebo sborníku
Tackling the Complexity in Speech
ISBN
978-80-7308-558-2
Počet stran výsledku
17
Strana od-do
205-221
Počet stran knihy
230
Název nakladatele
Filozofická fakulta Univerzity Karlovy v Praze
Místo vydání
Praha
Kód UT WoS kapitoly
—