Concatenation Artifact Detection Trained from Listeners Evaluations
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F13%3A43919442" target="_blank" >RIV/49777513:23520/13:43919442 - isvavai.cz</a>
Result on the web
<a href="http://link.springer.com/chapter/10.1007%2F978-3-642-40585-3_22" target="_blank" >http://link.springer.com/chapter/10.1007%2F978-3-642-40585-3_22</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-642-40585-3_22" target="_blank" >10.1007/978-3-642-40585-3_22</a>
Alternative languages
Result language
angličtina
Original language name
Concatenation Artifact Detection Trained from Listeners Evaluations
Original language description
Unit selection is known for its ability to produce high-quality synthetic speech. In contrast with HMM-based synthesis, it produces more natural speech but it may suffer from sudden quality drops at concatenation points. The danger of quality deterioration can be reduced (but, unfortunately, not eliminated) by using very large speech corpora. In this paper, our first experiment with automatic artifact detection is presented. Firstly, a brief description of artifacts is given. Then, a listening test experiment, in which listeners evaluated speech synthesis artifacts, is described. The data gathered during the listening test were then used to train an SVM classifer. Finally, results of the SVM-based artifact detection in synthetic speech are discussed.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
JD - Use of computers, robotics and its application
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/TA01011264" target="_blank" >TA01011264: Elimination of the language barriers faced by the handicapped watchers of the Czech Television II</a><br>
Continuities
S - Specificky vyzkum na vysokych skolach
Others
Publication year
2013
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Text, Speech, and Dialogue 16th International Conference, TSD 2013, Pilsen, Czech Republic, September 1-5, 2013. Proceedings
ISBN
978-3-642-40584-6
ISSN
0302-9743
e-ISSN
—
Number of pages
8
Pages from-to
169-176
Publisher name
Springer
Place of publication
Heidelberg
Event location
Plzeň
Event date
Sep 1, 2013
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—