Evaluation of TTS Personification by GMM-Based Speaker Gender and Age Classifier
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F16%3A43929884" target="_blank" >RIV/49777513:23520/16:43929884 - isvavai.cz</a>
Result on the web
<a href="http://link.springer.com/chapter/10.1007/978-3-319-45510-5_35" target="_blank" >http://link.springer.com/chapter/10.1007/978-3-319-45510-5_35</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-319-45510-5_35" target="_blank" >10.1007/978-3-319-45510-5_35</a>
Alternative languages
Result language
angličtina
Original language name
Evaluation of TTS Personification by GMM-Based Speaker Gender and Age Classifier
Original language description
This paper describes an experiment using the Gaussian mixture models (GMM)-based speaker gender and age classification for automatic evaluation of the achieved success in text-to-speech (TTS) system personification. The proposed two-level GMM classifier detects four age categories (child, young, adult, senior) as well as it discriminates gender for adult voices. This classifier is applied for gender/age estimation of the synthetic speech in Czech and Slovak languages produced by different TTS systems with several voices, using different speech inventories and speech modelling methods. The obtained results confirm the hypothesis that this type of classifier can be utilized as an alternative approach instead of the conventional listening test in the area of speech evaluation.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
JD - Use of computers, robotics and its application
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/GA16-04420S" target="_blank" >GA16-04420S: Combining phonetic and corpus-based approaches to remedy disruptive effects in synthetic speech</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2016
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Text, Speech, and Dialogue 19th International Conference, TSD 2016, Brno , Czech Republic, September 12-16, 2016, Proceedings
ISBN
978-3-319-45509-9
ISSN
0302-9743
e-ISSN
—
Number of pages
9
Pages from-to
305-313
Publisher name
Springer
Place of publication
Heidelberg
Event location
Brno, Česká republika
Event date
Sep 12, 2016
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
000389707400035