Czech time-domain TTS system with sample-by-sample harmonically pitch-normalized speech segment database
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F02%3A00076269" target="_blank" >RIV/49777513:23520/02:00076269 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Czech time-domain TTS system with sample-by-sample harmonically pitch-normalized speech segment database
Original language description
Monotonously recorded speech corpus is required to achieve high segmental quality in TTS systems. We record our own speech corpora with professional speakers. But, for the speaker, it is usually not easy to satisfy the requirement of monotonicity. The easier and cheaper way to obtain speech corpus for TTS system would be to use some of publicly available speech records or speech corpora available on the market. But those cannot be expected to be recorded monotonously. This paper proposes our effort to cope with it.We try techniques similar to "spectral reharmonization". The off-line algorithm is applied pitch-synchronously on every segment in the speech segment database. We use the FFT algorithm to obtain a set of harmonic parameters for every sub-segment defined by the time instants of neighboring pitch-marks. Described pitch-normalization algorithm is performed on voiced parts of the segment only.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
JD - Use of computers, robotics and its application
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/GA102%2F02%2F0124" target="_blank" >GA102/02/0124: Voice technologies for support of information society</a><br>
Continuities
Z - Vyzkumny zamer (s odkazem do CEZ)
Others
Publication year
2002
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Czech time-domain TTS system with sample-by-sample harmonically pitch-normalized speech segment database
ISBN
8086269094
ISSN
—
e-ISSN
—
Number of pages
3
Pages from-to
44
Publisher name
Academy of Sciences of Czech Republic
Place of publication
Prague
Event location
Prague
Event date
Jan 1, 2002
Type of event by nationality
CST - Celostátní akce
UT code for WoS article
—