All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

Unit-Selection Speech Synthesis Adjustments for Audiobook-Based Voices

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F16%3A43929881" target="_blank" >RIV/49777513:23520/16:43929881 - isvavai.cz</a>

  • Result on the web

    <a href="http://link.springer.com/chapter/10.1007/978-3-319-45510-5_38" target="_blank" >http://link.springer.com/chapter/10.1007/978-3-319-45510-5_38</a>

  • DOI - Digital Object Identifier

    <a href="http://dx.doi.org/10.1007/978-3-319-45510-5_38" target="_blank" >10.1007/978-3-319-45510-5_38</a>

Alternative languages

  • Result language

    angličtina

  • Original language name

    Unit-Selection Speech Synthesis Adjustments for Audiobook-Based Voices

  • Original language description

    This paper presents easy-to-use modifications to unit-selection speech-synthesis algorithm with voices built from audiobooks. Audiobooks are a very good source of large and high quality audio data for speech synthesis; however, they usually do not meet basic requirements for standard unit-selection synthesis: "neutral" speech properties with no expressive or spontaneous expressions, stable prosodic patterns, careful pronunciation, and consistent voice style during recording. However, if these conditions are taken into consideration, few modifications can be made to adjust the general unit-selection algorithm to make it more robust for synthesis from such audiobook data. Listening test shows that these adjustments increased perceived speech quality and acceptability against a baseline TTS system. Modifications presented here can also allow to exploit audio data variability to control pitch and tempo of synthesized speech.

  • Czech name

  • Czech description

Classification

  • Type

    D - Article in proceedings

  • CEP classification

    JD - Use of computers, robotics and its application

  • OECD FORD branch

Result continuities

  • Project

    <a href="/en/project/TA01011264" target="_blank" >TA01011264: Elimination of the language barriers faced by the handicapped watchers of the Czech Television II</a><br>

  • Continuities

    S - Specificky vyzkum na vysokych skolach

Others

  • Publication year

    2016

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

  • Article name in the collection

    Text, Speech, and Dialogue 19th International Conference, TSD 2016, Brno , Czech Republic, September 12-16, 2016, Proceedings

  • ISBN

    978-3-319-45509-9

  • ISSN

    0302-9743

  • e-ISSN

  • Number of pages

    8

  • Pages from-to

    335-342

  • Publisher name

    Springer

  • Place of publication

    Heidelberg

  • Event location

    Brno, Česká republika

  • Event date

    Sep 12, 2016

  • Type of event by nationality

    WRD - Celosvětová akce

  • UT code for WoS article

    000389707400038