HMM-based Speech Synthesis: Fist Experiments for the Czech Language
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F10%3A00504556" target="_blank" >RIV/49777513:23520/10:00504556 - isvavai.cz</a>
Výsledek na webu
—
DOI - Digital Object Identifier
—
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
HMM-based Speech Synthesis: Fist Experiments for the Czech Language
Popis výsledku v původním jazyce
In this paper, first experiments on statistical parametric HMM-based speech synthesis for the Czech language are described. For speech representation, two different analysis/synthesis methods were employed: traditional Mel cepstral analysis and a high-quality analysis/synthesis method STRAIGHT. Regarding the prosodic and linguistic characteristics of the Czech language, a basic set of contextual factors was proposed. Our experiments showed that disregarding syllabic structure of speech has an insignificant influence on resulting speech quality. The effect of training data amount was also studied. Results indicate, that our experimental HMM-based TTS system can produce speech of a similar quality as unit selection-based TTS system trained with a largeramount of speech data. Furthermore some simple experiments with the adaptation of trained HMMs were performed. In this manner new voices could be obtained with significantly lower amount of speech data.
Název v anglickém jazyce
HMM-based Speech Synthesis: Fist Experiments for the Czech Language
Popis výsledku anglicky
In this paper, first experiments on statistical parametric HMM-based speech synthesis for the Czech language are described. For speech representation, two different analysis/synthesis methods were employed: traditional Mel cepstral analysis and a high-quality analysis/synthesis method STRAIGHT. Regarding the prosodic and linguistic characteristics of the Czech language, a basic set of contextual factors was proposed. Our experiments showed that disregarding syllabic structure of speech has an insignificant influence on resulting speech quality. The effect of training data amount was also studied. Results indicate, that our experimental HMM-based TTS system can produce speech of a similar quality as unit selection-based TTS system trained with a largeramount of speech data. Furthermore some simple experiments with the adaptation of trained HMMs were performed. In this manner new voices could be obtained with significantly lower amount of speech data.
Klasifikace
Druh
D - Stať ve sborníku
CEP obor
JC - Počítačový hardware a software
OECD FORD obor
—
Návaznosti výsledku
Projekt
<a href="/cs/project/2C06020" target="_blank" >2C06020: Eliminace jazykových bariér handicapovaných diváků České televize</a><br>
Návaznosti
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Ostatní
Rok uplatnění
2010
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název statě ve sborníku
Speech Processing
ISBN
978-80-86269-21-4
ISSN
—
e-ISSN
—
Počet stran výsledku
8
Strana od-do
—
Název nakladatele
Institute of Photonics and Electronics Academy of Sciences of the Czech Republic, Prague
Místo vydání
Prague
Místo konání akce
Praha
Datum konání akce
1. 1. 2010
Typ akce podle státní příslušnosti
EUR - Evropská akce
Kód UT WoS článku
—