On Building Phonetically and Prosodically Rich Speech Corpus for Text-to-Speech Synthesis

The result's identifiers

Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F06%3A00005351" target="_blank" >RIV/00216208:11320/06:00005351 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—

Alternative languages

Result language
angličtina
Original language name
On Building Phonetically and Prosodically Rich Speech Corpus for Text-to-Speech Synthesis
Original language description
This paper proposes a way of preparing and recording a speech corpus for unit selection text-to-speech speech synthesis driven by symbolic prosody. The research is focused on a phonetically and prosodically rich sentence selection algorithm. Symbolic description on a deep prosody level is used to enrich the phonetic representation of sentences (by respecting the prosodeme types phones appear in). The resulting algorithm then selects sentences with respect to both phonetic and prosodic criteria. To coversupra-sentential prosody phenomena, paragraphs were selected at random and recorded as well. The new speech corpus can be utilised in unit selection speech synthesis and also for training a data-driven prosodic parser.
Czech name
Vytváření foneticky a prozodicky bohatých řečových korpusů v úloze syntézy řeči z textu
Czech description
Článek navrhuje metodu přípravy a pořízení řečového korpusu pro úlohu syntézy řeči z textu s dynamickým výběrem jednotek řízenou pomocí symbolické prozodie. Soustředí se na algoritmus výběru foneticky a prozodicky bohatých vět. Foneticky přepsané věty jsou obohaceny o symbolický popis na hrubé prozodické úrovni s respektováním typu prozodému, ve kterém se fony objevují. Výsledný algoritmus pak vybírá věty s ohledem na fonetická i prozodická kritéria. Abychom též pokryli i supravětné prozodické jevy, náhodně jsme vybrali odstavce a nahráli je. Nový řečový korpus se může využít k syntéze řeči s dynamickým výběrem jednotek a také k trénování datově orientovaného prozodického parseru.

Classification

Type
D - Article in proceedings
CEP classification
AI - Linguistics
OECD FORD branch
—

Result continuities

Project
<a href="/en/project/LC536" target="_blank" >LC536: Integrated center for natural language processing</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)

Others

Publication year
2006
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

Article name in the collection
Proceedings of the second IASTED international conference on Computational intelligence
ISBN
—
ISSN
—
e-ISSN
—
Number of pages
6
Pages from-to
442-447
Publisher name
ACTA Press
Place of publication
Anaheim, USA
Event location
Anaheim, USA
Event date
Jan 1, 2006
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—

Similar results(10)

On building phonetically and prosodically rich speech corpus for text-to-speech synthesis On Building Phonetically and Prosodically Rich Speech Corpus for Text-to-Speech Synthesis Recording and Annotation of Speech Corpus for Czech Unit Selection Speech Synthesis

What are you looking for?

Quick search

Smart search

On Building Phonetically and Prosodically Rich Speech Corpus for Text-to-Speech Synthesis

The result's identifiers

Alternative languages

Classification

Result continuities

Others

Data specific for result type

Similar results(10)

What are you looking for?

Quick search

Smart search

Result description

The result's identifiers

The result's identifiers

Alternative languages

Alternative languages

Classification

Classification

Result continuities

Result continuities

Others

Others

Data specific for result type

Data specific for result type

Similar results(10)