UWB-NTIS Speaker Diarization System for the DIHARD II 2019 Challenge
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F19%3A43956400" target="_blank" >RIV/49777513:23520/19:43956400 - isvavai.cz</a>
Result on the web
<a href="https://www.isca-speech.org/archive/Interspeech_2019/abstracts/1385.html" target="_blank" >https://www.isca-speech.org/archive/Interspeech_2019/abstracts/1385.html</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.21437/Interspeech.2019-1385" target="_blank" >10.21437/Interspeech.2019-1385</a>
Alternative languages
Result language
angličtina
Original language name
UWB-NTIS Speaker Diarization System for the DIHARD II 2019 Challenge
Original language description
In this paper, we present our system developed by the team from the New Technologies for the Information Society (NTIS) research center of the University of West Bohemia in Pilsen, for the Second DIHARD Speech Diarization Challenge. The base of our system follows the currently-standard approach of segmentation, i/x-vector extraction, clustering, and resegmentation. The hyperparameters for each of the subsystems were selected according to the domain classifier trained on the development set of DIHARD II. We compared our system with results from the Kaldi diarization (with i/x-vectors) and combined these systems. At the time of writing of this abstract, our best submission achieved a DER of 23.47% and a JER of 48.99% on the evaluation set (in Track 1 using reference SAD).
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
20205 - Automation and control systems
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2019
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of the 20th Annual Conference of the International Speech Communication Association (Interspeech 2019)
ISBN
978-1-5108-9683-3
ISSN
2308-457X
e-ISSN
—
Number of pages
5
Pages from-to
993-997
Publisher name
Curran Associates, Inc.
Place of publication
Red Hook, NY
Event location
Graz, Austria
Event date
Sep 15, 2019
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—