The phase substitutions in Czech harmonic concatenative speech synthesis
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F03%3A00000252" target="_blank" >RIV/49777513:23520/03:00000252 - isvavai.cz</a>
Alternative codes found
RIV/49777513:23520/03:00000054
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
The phase substitutions in Czech harmonic concatenative speech synthesis
Original language description
This paper describes the issues of the usage of various phase component types in the development of the Czech TTS system based on harmonic sinusoidal signal representation. We have found the approaches for speech representation based on sinusoidal coding[1] or harmonic plus noise modelling [2] very promising. It is mainly due to possibility of high compression of the spectral representation and possibility to 'smooth' the transitions on the spectral level. The major inconvenience is the necessity to use natural phase components to reach quality synthesis with preserved naturalness. Trying to interpolate the phase components across the concatenations causes the discontinuities in generated signal. We found that the discontinuities substantially degradethe fluency of synthesized speech. We propose the method of substituting the phase components by one locally constant phase component to guarantee the local phase coherence.
Czech name
Substituce fáze v české harmonické konkatenační syntéze řeči.
Czech description
Tento článek popisuje hlediska užití různých typů složek fáze při vývoji českého TTS systému založeného na reprezentaci harmonickým sinusoidálním signálem. Nalezli jsme velmi slibný přístup pro reprezentaci řeči založený na sinusoidálním kodování [1] nebo harmonickém plus šumovém modelování.
Classification
Type
J<sub>x</sub> - Unclassified - Peer-reviewed scientific article (Jimp, Jsc and Jost)
CEP classification
JD - Use of computers, robotics and its application
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/GA102%2F02%2F0124" target="_blank" >GA102/02/0124: Voice technologies for support of information society</a><br>
Continuities
Z - Vyzkumny zamer (s odkazem do CEZ)
Others
Publication year
2003
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Lecture Notes in Artificial Intelligence
ISSN
0302-9743
e-ISSN
—
Volume of the periodical
—
Issue of the periodical within the volume
—
Country of publishing house
DE - GERMANY
Number of pages
8
Pages from-to
333
UT code for WoS article
—
EID of the result in the Scopus database
—