Factorization of Discriminatively Trained i-Vector Extractor for Speaker Recognition
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216305%3A26230%2F19%3APU134181" target="_blank" >RIV/00216305:26230/19:PU134181 - isvavai.cz</a>
Result on the web
<a href="https://www.isca-speech.org/archive/Interspeech_2019/pdfs/1757.pdf" target="_blank" >https://www.isca-speech.org/archive/Interspeech_2019/pdfs/1757.pdf</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.21437/Interspeech.2019-1757" target="_blank" >10.21437/Interspeech.2019-1757</a>
Alternative languages
Result language
angličtina
Original language name
Factorization of Discriminatively Trained i-Vector Extractor for Speaker Recognition
Original language description
In this work, we continue in our research on i-vector extractor for speaker verification (SV) and we optimize its architecture for fast and effective discriminative training. We were motivated by computational and memory requirements caused by the large number of parameters of the original generative ivector model. Our aim is to preserve the power of the original generative model, and at the same time focus the model towards extraction of speaker-related information. We show that it is possible to represent a standard generative i-vector extractor by a model with significantly less parameters and obtain similar performance on SV tasks. We can further refine this compact model by discriminative training and obtain i-vectors that lead to better performance on various SV benchmarks representing different acoustic domains.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2019
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of Interspeech
ISBN
—
ISSN
1990-9772
e-ISSN
—
Number of pages
5
Pages from-to
4330-4334
Publisher name
International Speech Communication Association
Place of publication
Graz
Event location
INTERSPEECH 2019
Event date
Sep 15, 2019
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
000831796404095