Phone Speech Detection and Recognition in the Task of Historical Radio Broadcast Transcription
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F46747885%3A24220%2F15%3A%230003421" target="_blank" >RIV/46747885:24220/15:#0003421 - isvavai.cz</a>
Výsledek na webu
<a href="http://dx.doi.org/10.1109/TSP.2015.7296399" target="_blank" >http://dx.doi.org/10.1109/TSP.2015.7296399</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1109/TSP.2015.7296399" target="_blank" >10.1109/TSP.2015.7296399</a>
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
Phone Speech Detection and Recognition in the Task of Historical Radio Broadcast Transcription
Popis výsledku v původním jazyce
This paper deals with methods and strategies for the improvement of a system for the automatic transcription of the historical Czech Radio audio archive. The main goal of this work was to improve the recognition of audio signals containing phone speech where the resulting recognition rate was relatively low because of frequency-limited phone signals. A phone signal detector based on GMM was developed and implemented to our speech transcription system. Several different acoustic models were experimentally tested for the enhancement of phone speech signal recognition. We demonstrate that phone speech recognition is improved significantly if acoustic models based on HMM are trained directly on phone speech signals. Other possible and logical strategies, which are described in this paper, did not produce the required improvement. The resulting accuracy of phone speech signal recognition has been increased from 47.32% to 68.30%.
Název v anglickém jazyce
Phone Speech Detection and Recognition in the Task of Historical Radio Broadcast Transcription
Popis výsledku anglicky
This paper deals with methods and strategies for the improvement of a system for the automatic transcription of the historical Czech Radio audio archive. The main goal of this work was to improve the recognition of audio signals containing phone speech where the resulting recognition rate was relatively low because of frequency-limited phone signals. A phone signal detector based on GMM was developed and implemented to our speech transcription system. Several different acoustic models were experimentally tested for the enhancement of phone speech signal recognition. We demonstrate that phone speech recognition is improved significantly if acoustic models based on HMM are trained directly on phone speech signals. Other possible and logical strategies, which are described in this paper, did not produce the required improvement. The resulting accuracy of phone speech signal recognition has been increased from 47.32% to 68.30%.
Klasifikace
Druh
D - Stať ve sborníku
CEP obor
JC - Počítačový hardware a software
OECD FORD obor
—
Návaznosti výsledku
Projekt
—
Návaznosti
I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace
Ostatní
Rok uplatnění
2015
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název statě ve sborníku
38th International Conference on Telecommunications and Signal Processing, TSP 2015
ISBN
978-1-4799-8498-5
ISSN
—
e-ISSN
—
Počet stran výsledku
4
Strana od-do
433-436
Název nakladatele
Institute of Electrical and Electronics Engineers Inc.
Místo vydání
Praha, Česká Republika
Místo konání akce
Praha
Datum konání akce
1. 1. 2015
Typ akce podle státní příslušnosti
WRD - Celosvětová akce
Kód UT WoS článku
—