Influence of different phoneme mappings on the recognition accuracy of electrolaryngeal speech
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F12%3A43916819" target="_blank" >RIV/49777513:23520/12:43916819 - isvavai.cz</a>
Výsledek na webu
—
DOI - Digital Object Identifier
—
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
Influence of different phoneme mappings on the recognition accuracy of electrolaryngeal speech
Popis výsledku v původním jazyce
This paper presents the initial steps towards building speech recognition system that is able to efficiently process electrolaryngeal substitute speech produced by laryngectomees. Speakers after total laryngectomy are characterized by restricted aero-acoustic properties in comparison with normal speakers and their speech is therefore far less intelligible. We suggested and tested several approaches to acoustic modeling within the ASR system that would be able to cope with this lower intelligibility. Comparative experiments were also performed on the healthy speakers. We tried several mappings that unify unvoiced phonemes with their voiced counterparts in the acoustic modeling process both on monophone and triphone level. Systems using zerogram and trigram language models were evaluated and compared in order to increase the credibility of the results.
Název v anglickém jazyce
Influence of different phoneme mappings on the recognition accuracy of electrolaryngeal speech
Popis výsledku anglicky
This paper presents the initial steps towards building speech recognition system that is able to efficiently process electrolaryngeal substitute speech produced by laryngectomees. Speakers after total laryngectomy are characterized by restricted aero-acoustic properties in comparison with normal speakers and their speech is therefore far less intelligible. We suggested and tested several approaches to acoustic modeling within the ASR system that would be able to cope with this lower intelligibility. Comparative experiments were also performed on the healthy speakers. We tried several mappings that unify unvoiced phonemes with their voiced counterparts in the acoustic modeling process both on monophone and triphone level. Systems using zerogram and trigram language models were evaluated and compared in order to increase the credibility of the results.
Klasifikace
Druh
D - Stať ve sborníku
CEP obor
JD - Využití počítačů, robotika a její aplikace
OECD FORD obor
—
Návaznosti výsledku
Projekt
<a href="/cs/project/ED1.1.00%2F02.0090" target="_blank" >ED1.1.00/02.0090: NTIS - Nové technologie pro informační společnost</a><br>
Návaznosti
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>S - Specificky vyzkum na vysokych skolach
Ostatní
Rok uplatnění
2012
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název statě ve sborníku
Proceedings of the International Conference on Signal Processing and Multimedia Applications and Multimedia Applications and Wireless Information Networks and Systems
ISBN
978-989-8565-25-9
ISSN
—
e-ISSN
—
Počet stran výsledku
4
Strana od-do
204-207
Název nakladatele
SciTePress
Místo vydání
[S.l.]
Místo konání akce
Řím, Itálie
Datum konání akce
24. 7. 2012
Typ akce podle státní příslušnosti
WRD - Celosvětová akce
Kód UT WoS článku
—