Text-dependent speaker verification based on i-vectors, Neural Networks and Hidden Markov Models
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216305%3A26230%2F17%3APU126451" target="_blank" >RIV/00216305:26230/17:PU126451 - isvavai.cz</a>
Result on the web
<a href="http://www.sciencedirect.com/science/article/pii/S0885230816303199" target="_blank" >http://www.sciencedirect.com/science/article/pii/S0885230816303199</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1016/j.csl.2017.04.005" target="_blank" >10.1016/j.csl.2017.04.005</a>
Alternative languages
Result language
angličtina
Original language name
Text-dependent speaker verification based on i-vectors, Neural Networks and Hidden Markov Models
Original language description
Inspired by the success of Deep Neural Networks (DNN) in text-independent speaker recognition, we have recently demonstrated that similar ideas can also be applied to the text-dependent speaker verification task. In this paper, we describe new advances with our state-of-the-art i-vector based approach to text-dependent speaker verification, which also makes use of different DNN techniques. In order to collect sufficient statistics for i-vector extraction, different frame alignment models are compared such as GMMs, phonemic HMMs or DNNs trained for senone classification. We also experiment with DNN based bottleneck features and their combinations with standard MFCC features. We experiment with few different DNN configurations and investigate the importance of training DNNs on 16 kHz speech. The results are reported on RSR2015 dataset, where training material is available for all possible enrollment and test phrases. Additionally, we report results also on more challenging RedDots dataset, where the system is built in truly phrase-independent way.
Czech name
—
Czech description
—
Classification
Type
J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2017
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
COMPUTER SPEECH AND LANGUAGE
ISSN
0885-2308
e-ISSN
1095-8363
Volume of the periodical
2017
Issue of the periodical within the volume
46
Country of publishing house
GB - UNITED KINGDOM
Number of pages
19
Pages from-to
53-71
UT code for WoS article
000407609600003
EID of the result in the Scopus database
2-s2.0-85019904410