On Minimizing the Size of Speech Unit Database in Concatenative Speech Synthesis

The result's identifiers

Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F06%3A00000536" target="_blank" >RIV/49777513:23520/06:00000536 - isvavai.cz</a>
Alternative codes found
RIV/49777513:23520/06:00000105
Result on the web
—
DOI - Digital Object Identifier
—

Alternative languages

Result language
angličtina
Original language name
On Minimizing the Size of Speech Unit Database in Concatenative Speech Synthesis
Original language description
In this paper, minimization of speech unit database is researched in order to have a compact speech unit database yielding a "good enough" synthetic speech usable also for low-resource devices. We focused mainly on HMM-based speech unit database preparation, a process which prepares a set of context-dependent phones (triphones) by means of HMM modelling, CART-based clustering, and HMM-based segmentation in a fully automatic way. Three experiments are described in the paper: the first one concerns the size of the source speech corpus, the second one deals with the triphone clustering process, and the last one concerns the modelling of the cross-word dependencies. The final minimised system exploits techniques used in all three experiments. The size of the resulting speech unit database decreased from 28.1 to 1.6 MB. The resulting synthetic speech was then judged by means of CCR listening tests and evaluated as "slightly worse" than speech generated by the baseline system.
Czech name
Minimalizace velikosti databáze řečových jednotek v úloze konkatenační syntézy řeči
Czech description
V článku se zkoumají možnosti minimalizace databáze řečových jednotek za účelem získání kompaktní databáze řečových jednotek, která bude poskytovat syntetickou řeč "rozumné kvality" také pro zařízení s menšími systémovými zdroji. Zaměřili jsme se zejménana přípravu databáze řečových jednotek s využitím HMM, plně automatický proces, který připravuje soubor kontextově závislých fonů (trifonů) pomocí modelování HMM, shlukování založeného na CART a segmentace s využitím HMM V článku jsou popsány tři experimenty: první experiment se týká velikosti zdrojového řečového korpusu, druhý experiment se zabývá procesem shlukování trifo

Classification

Type
D - Article in proceedings
CEP classification
JD - Use of computers, robotics and its application
OECD FORD branch
—

Result continuities

Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)

Others

Publication year
2006
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

Article name in the collection
Speech Processing
ISBN
80-86269-15-9
ISSN
—
e-ISSN
—
Number of pages
7
Pages from-to
70-76
Publisher name
Institute of Radio Engineering and Electronics AS CR
Place of publication
Prague
Event location
Praha
Event date
Jan 1, 2006
Type of event by nationality
EUR - Evropská akce
UT code for WoS article
—

Similar results(10)

Using Hidden Markov Models for speech synthesis Speech synthesis using HMM-based acoustic unit inventory Statistical approach to the automatic synthesis of czech speech

What are you looking for?

Quick search

Smart search

On Minimizing the Size of Speech Unit Database in Concatenative Speech Synthesis

The result's identifiers

Alternative languages

Classification

Result continuities

Others

Data specific for result type

Similar results(10)

What are you looking for?

Quick search

Smart search

Result description

The result's identifiers

The result's identifiers

Alternative languages

Alternative languages

Classification

Classification

Result continuities

Result continuities

Others

Others

Data specific for result type

Data specific for result type

Similar results(10)