SdSV Challenge 2020: Large-Scale Evaluation of Short-duration Speaker Verification
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216305%3A26230%2F20%3APU138637" target="_blank" >RIV/00216305:26230/20:PU138637 - isvavai.cz</a>
Result on the web
<a href="https://www.isca-speech.org/archive/Interspeech_2020/pdfs/1485.pdf" target="_blank" >https://www.isca-speech.org/archive/Interspeech_2020/pdfs/1485.pdf</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.21437/Interspeech.2020-1485" target="_blank" >10.21437/Interspeech.2020-1485</a>
Alternative languages
Result language
angličtina
Original language name
SdSV Challenge 2020: Large-Scale Evaluation of Short-duration Speaker Verification
Original language description
Modern approaches to speaker verification represent speech utterances as fixed-length embeddings. With these approaches, we implicitly assume that speaker characteristics are independent of the spoken content. Such an assumption generally holds when sufficiently long utterances are given. In this context, speaker embeddings, like i-vector and x-vector, have shown to be extremely effective. For speech utterances of short duration (in the order of a few seconds), speaker embeddings have shown significant dependency on the phonetic content. In this regard, the SdSV Challenge 2020 was organized with a broad focus on systematic benchmark and analysis on varying degrees of phonetic variability on short-duration speaker verification (SdSV). In addition to text-dependent and text-independent tasks, the challenge features an unusual and difficult task of cross-lingual speaker verification (English vs. Persian). This paper describes the dataset and tasks, the evaluation rules and protocols, the performance metric, baseline systems, and challenge results. We also present insights gained from the evaluation and future research directions.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
<a href="/en/project/LQ1602" target="_blank" >LQ1602: IT4Innovations excellence in science</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>S - Specificky vyzkum na vysokych skolach
Others
Publication year
2020
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
ISBN
—
ISSN
1990-9772
e-ISSN
—
Number of pages
5
Pages from-to
731-735
Publisher name
International Speech Communication Association
Place of publication
Shanghai
Event location
Sanghai
Event date
Oct 25, 2020
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
000833594100152