Phone Speech Detection and Recognition in the Task of Historical Radio Broadcast Transcription
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F46747885%3A24220%2F15%3A%230003421" target="_blank" >RIV/46747885:24220/15:#0003421 - isvavai.cz</a>
Result on the web
<a href="http://dx.doi.org/10.1109/TSP.2015.7296399" target="_blank" >http://dx.doi.org/10.1109/TSP.2015.7296399</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1109/TSP.2015.7296399" target="_blank" >10.1109/TSP.2015.7296399</a>
Alternative languages
Result language
angličtina
Original language name
Phone Speech Detection and Recognition in the Task of Historical Radio Broadcast Transcription
Original language description
This paper deals with methods and strategies for the improvement of a system for the automatic transcription of the historical Czech Radio audio archive. The main goal of this work was to improve the recognition of audio signals containing phone speech where the resulting recognition rate was relatively low because of frequency-limited phone signals. A phone signal detector based on GMM was developed and implemented to our speech transcription system. Several different acoustic models were experimentally tested for the enhancement of phone speech signal recognition. We demonstrate that phone speech recognition is improved significantly if acoustic models based on HMM are trained directly on phone speech signals. Other possible and logical strategies, which are described in this paper, did not produce the required improvement. The resulting accuracy of phone speech signal recognition has been increased from 47.32% to 68.30%.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
JC - Computer hardware and software
OECD FORD branch
—
Result continuities
Project
—
Continuities
I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace
Others
Publication year
2015
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
38th International Conference on Telecommunications and Signal Processing, TSP 2015
ISBN
978-1-4799-8498-5
ISSN
—
e-ISSN
—
Number of pages
4
Pages from-to
433-436
Publisher name
Institute of Electrical and Electronics Engineers Inc.
Place of publication
Praha, Česká Republika
Event location
Praha
Event date
Jan 1, 2015
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—