Automatic Evaluation of Synthetic Speech Quality by a System Based on Statistical Analysis
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F18%3A43952591" target="_blank" >RIV/49777513:23520/18:43952591 - isvavai.cz</a>
Result on the web
<a href="https://link.springer.com/chapter/10.1007%2F978-3-030-00794-2_34" target="_blank" >https://link.springer.com/chapter/10.1007%2F978-3-030-00794-2_34</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-030-00794-2_34" target="_blank" >10.1007/978-3-030-00794-2_34</a>
Alternative languages
Result language
angličtina
Original language name
Automatic Evaluation of Synthetic Speech Quality by a System Based on Statistical Analysis
Original language description
The paper describes a system for automatic evaluation of speech quality based on statistical analysis of differences in spectral properties, prosodic parameters, and time structuring within the speech signal. The proposed system was successfully tested in evaluation of sentences originating from male and female voices and produced by a speech synthesizer using the unit selection method with two different approaches to prosody manipulation. The experiments show necessity of all three types of speech features for obtaining correct, sharp, and stable results. A detailed analysis shows great influence of the number of statistical parameters on correctness and precision of the evaluated results. Larger size of the processed speech material has a positive impact on stability of the evaluation process. Final comparison documents basic correlation with the results obtained by the standard listening test.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
20205 - Automation and control systems
Result continuities
Project
<a href="/en/project/GA16-04420S" target="_blank" >GA16-04420S: Combining phonetic and corpus-based approaches to remedy disruptive effects in synthetic speech</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2018
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Text, Speech, and Dialogue 21st International Conference, TSD 2018, Brno, Czech Republic, September 11-14, 2018, Proceedings
ISBN
978-3-030-00793-5
ISSN
0302-9743
e-ISSN
1611-3349
Number of pages
9
Pages from-to
315-323
Publisher name
Springer Nature Switzerland AG
Place of publication
Cham
Event location
Brno, Czech Republic
Event date
Sep 11, 2018
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—