The partly preserved natural phases in the concatenative speech synthesis based on the harmonic/noise approach
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F03%3A00000206" target="_blank" >RIV/49777513:23520/03:00000206 - isvavai.cz</a>
Výsledek na webu
—
DOI - Digital Object Identifier
—
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
The partly preserved natural phases in the concatenative speech synthesis based on the harmonic/noise approach
Popis výsledku v původním jazyce
This paper describes our advances in the development of the Czech TTS system achieved mainly in the field of speech signal generation. We found the approaches for speech representation based on sinusoidal coding [1] or harmonic plus noise modeling [2] respectively very promising for our goal. It is mainly due to high compression possibility of the spectral representation of the speech. The major inconvenience is the necessity of natural phase components to reach quality naturally sounding synthesis. Since there is no known method for suitable phase representation, the methods for its substitution must be searched. In our experiments, we observed the phase coherence to be more important (from the view of naturalness) then the necessity of the strict usage of the original phase component in all instants (frames). We proceed from this experience and here we propose our method where only the one phase vector is needed for each voiced segment (continuous sequence of voiced frames) in ever
Název v anglickém jazyce
The partly preserved natural phases in the concatenative speech synthesis based on the harmonic/noise approach
Popis výsledku anglicky
This paper describes our advances in the development of the Czech TTS system achieved mainly in the field of speech signal generation. We found the approaches for speech representation based on sinusoidal coding [1] or harmonic plus noise modeling [2] respectively very promising for our goal. It is mainly due to high compression possibility of the spectral representation of the speech. The major inconvenience is the necessity of natural phase components to reach quality naturally sounding synthesis. Since there is no known method for suitable phase representation, the methods for its substitution must be searched. In our experiments, we observed the phase coherence to be more important (from the view of naturalness) then the necessity of the strict usage of the original phase component in all instants (frames). We proceed from this experience and here we propose our method where only the one phase vector is needed for each voiced segment (continuous sequence of voiced frames) in ever
Klasifikace
Druh
J<sub>x</sub> - Nezařazeno - Článek v odborném periodiku (Jimp, Jsc a Jost)
CEP obor
JD - Využití počítačů, robotika a její aplikace
OECD FORD obor
—
Návaznosti výsledku
Projekt
<a href="/cs/project/GA102%2F02%2F0124" target="_blank" >GA102/02/0124: Hlasové technologie v podpoře informační společnosti</a><br>
Návaznosti
Z - Vyzkumny zamer (s odkazem do CEZ)
Ostatní
Rok uplatnění
2003
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název periodika
WSEAS Transactions on Computers
ISSN
1109-2750
e-ISSN
—
Svazek periodika
2
Číslo periodika v rámci svazku
Červenec
Stát vydavatele periodika
GR - Řecká republika
Počet stran výsledku
6
Strana od-do
714-719
Kód UT WoS článku
—
EID výsledku v databázi Scopus
—