Unit-Selection Speech Synthesis Adjustments for Audiobook-Based Voices
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F16%3A43929881" target="_blank" >RIV/49777513:23520/16:43929881 - isvavai.cz</a>
Result on the web
<a href="http://link.springer.com/chapter/10.1007/978-3-319-45510-5_38" target="_blank" >http://link.springer.com/chapter/10.1007/978-3-319-45510-5_38</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-319-45510-5_38" target="_blank" >10.1007/978-3-319-45510-5_38</a>
Alternative languages
Result language
angličtina
Original language name
Unit-Selection Speech Synthesis Adjustments for Audiobook-Based Voices
Original language description
This paper presents easy-to-use modifications to unit-selection speech-synthesis algorithm with voices built from audiobooks. Audiobooks are a very good source of large and high quality audio data for speech synthesis; however, they usually do not meet basic requirements for standard unit-selection synthesis: "neutral" speech properties with no expressive or spontaneous expressions, stable prosodic patterns, careful pronunciation, and consistent voice style during recording. However, if these conditions are taken into consideration, few modifications can be made to adjust the general unit-selection algorithm to make it more robust for synthesis from such audiobook data. Listening test shows that these adjustments increased perceived speech quality and acceptability against a baseline TTS system. Modifications presented here can also allow to exploit audio data variability to control pitch and tempo of synthesized speech.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
JD - Use of computers, robotics and its application
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/TA01011264" target="_blank" >TA01011264: Elimination of the language barriers faced by the handicapped watchers of the Czech Television II</a><br>
Continuities
S - Specificky vyzkum na vysokych skolach
Others
Publication year
2016
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Text, Speech, and Dialogue 19th International Conference, TSD 2016, Brno , Czech Republic, September 12-16, 2016, Proceedings
ISBN
978-3-319-45509-9
ISSN
0302-9743
e-ISSN
—
Number of pages
8
Pages from-to
335-342
Publisher name
Springer
Place of publication
Heidelberg
Event location
Brno, Česká republika
Event date
Sep 12, 2016
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
000389707400038