Analysis of Speaker Recognition Systems in Realistic Scenarios of the SITW 2016 Challenge
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216305%3A26230%2F16%3APU122425" target="_blank" >RIV/00216305:26230/16:PU122425 - isvavai.cz</a>
Result on the web
<a href="https://www.researchgate.net/publication/307889224_Analysis_of_Speaker_Recognition_Systems_in_Realistic_Scenarios_of_the_SITW_2016_Challenge" target="_blank" >https://www.researchgate.net/publication/307889224_Analysis_of_Speaker_Recognition_Systems_in_Realistic_Scenarios_of_the_SITW_2016_Challenge</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.21437/Interspeech.2016-981" target="_blank" >10.21437/Interspeech.2016-981</a>
Alternative languages
Result language
angličtina
Original language name
Analysis of Speaker Recognition Systems in Realistic Scenarios of the SITW 2016 Challenge
Original language description
In this paper, we summarize our efforts for the Speakers In The Wild (SITW) challenge, and we present our findings with this new dataset for speaker recognition. Apart from the standard comparison of different SRE systems, we analyze the use of diarization for dealing with audio segments containing multiple speakers, as in part of the newly introduced enrollment and test protocols, diarization is a necessary system component. Our state-of-the-art systems used in this work utilize both cepstral and DNN-based bottleneck features and are based on i-vectors followed by Probabilistic Linear Discriminant Analysis (PLDA) classifier and logistic regression calibration/fusion. We present both narrow-band (8 kHz) and wide-band (16 kHz) systems together with their fusions.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
<a href="/en/project/VI20152020025" target="_blank" >VI20152020025: Information mining in speech acquired by distant microphones - DRAPÁK</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2016
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of Interspeech 2016
ISBN
978-1-5108-3313-5
ISSN
—
e-ISSN
—
Number of pages
5
Pages from-to
828-832
Publisher name
International Speech Communication Association
Place of publication
San Francisco
Event location
San Francisco
Event date
Sep 8, 2016
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
000409394400173