Automatic Text-Independent Artifact Detection, Localization, and Classification in the Synthetic Speech

The result's identifiers

Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F17%3A43932708" target="_blank" >RIV/49777513:23520/17:43932708 - isvavai.cz</a>
Result on the web
<a href="https://www.radioeng.cz/fulltexts/2017/17_04_1151_1160.pdf" target="_blank" >https://www.radioeng.cz/fulltexts/2017/17_04_1151_1160.pdf</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.13164/re.2017.1151" target="_blank" >10.13164/re.2017.1151</a>

Alternative languages

Result language
angličtina
Original language name
Automatic Text-Independent Artifact Detection, Localization, and Classification in the Synthetic Speech
Original language description
The paper describes an experiment with statistical approaches to automatic detection, localization, and classification of the basic types of artifacts in the synthetic speech produced by the Czech text-to-speech system using the unit selection method. The aim of the first experiment is to detect the artifacts by the analysis of variances (ANOVA) and hypothesis testing. The second experiment is focused on localization of the detected artifacts by the Gaussian mixture models (GMM). Finally, the developed open-set artifact classifier is described. The influence of the length of the feature vector and its composition on the resulting artifact detection accuracy is also analyzed together with other factors affecting the stability of the artifact detection process. Further investigations have shown a relatively great influence of the number of mixtures and the type of a covariance matrix on the output artifact classification error rate as well as on the computational complexity. Results of the performed experiments confirm the functionality of the proposed artifact detector based on the ANOVA and hypothesis tests, and the GMM-based artifact localizer and classifier. The described statistical approaches represent the alternatives to the standard listening tests and the manual labeling of the artifacts.
Czech name
—
Czech description
—

Classification

Type
J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database
CEP classification
—
OECD FORD branch
20205 - Automation and control systems

Result continuities

Project
<a href="/en/project/GA16-04420S" target="_blank" >GA16-04420S: Combining phonetic and corpus-based approaches to remedy disruptive effects in synthetic speech</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)

Others

Publication year
2017
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

Name of the periodical
Radioengineering
ISSN
1210-2512
e-ISSN
—
Volume of the periodical
26
Issue of the periodical within the volume
4
Country of publishing house
CZ - CZECH REPUBLIC
Number of pages
10
Pages from-to
1151-1160
UT code for WoS article
000423270000032
EID of the result in the Scopus database
2-s2.0-85038078461

Similar results(10)

Evaluation of TTS Personification by GMM-Based Speaker Gender and Age Classifier Evaluation of Synthetic Speech by GMM-Based Continuous Detection of Emotional States GMM-Based Speaker Gender and Age Classification After Voice Conversion

What are you looking for?

Quick search

Smart search

Automatic Text-Independent Artifact Detection, Localization, and Classification in the Synthetic Speech

The result's identifiers

Alternative languages

Classification

Result continuities

Others

Data specific for result type

Similar results(10)

What are you looking for?

Quick search

Smart search

Result description

The result's identifiers

The result's identifiers

Alternative languages

Alternative languages

Classification

Classification

Result continuities

Result continuities

Others

Others

Data specific for result type

Data specific for result type

Similar results(10)