Experiment with GMM-Based Artefact Localization in Czech Synthetic Speech
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F15%3A43926579" target="_blank" >RIV/49777513:23520/15:43926579 - isvavai.cz</a>
Result on the web
<a href="http://link.springer.com/chapter/10.1007%2F978-3-319-24033-6_3" target="_blank" >http://link.springer.com/chapter/10.1007%2F978-3-319-24033-6_3</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-319-24033-6_3" target="_blank" >10.1007/978-3-319-24033-6_3</a>
Alternative languages
Result language
angličtina
Original language name
Experiment with GMM-Based Artefact Localization in Czech Synthetic Speech
Original language description
The paper describes an experiment with using the statistical approach based on the Gaussian mixture models (GMM) for localization of artefacts in the synthetic speech produced by the Czech text-to-speech system employing the unit selection principle. In addition, the paper analyzes influence of different number of used GMM mixtures, and the influence of setting of the frame shift during the spectral feature analysis on the resulting artefact position accuracy. Obtained results of performed experiments confirm proper function of the chosen concept and the presented artefact position localizer can be used as an alternative to the standardly applied manual localization method.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
20205 - Automation and control systems
Result continuities
Project
<a href="/en/project/TA01030476" target="_blank" >TA01030476: Intelligent technologies for improving air traffic security</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2015
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Text, Speech, and Dialogue, 18th International Conference, TSD 2015, Pilsen, Czech Republic, September 14-17, 2015. Proceedings
ISBN
978-3-319-24032-9
ISSN
0302-9743
e-ISSN
—
Number of pages
9
Pages from-to
23-31
Publisher name
Springer
Place of publication
Berlin
Event location
Plzeň, Czech Republic
Event date
Sep 14, 2015
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
000365947800003