Diarization Based on Identification with X-Vectors
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F20%3A43959814" target="_blank" >RIV/49777513:23520/20:43959814 - isvavai.cz</a>
Result on the web
<a href="https://link.springer.com/chapter/10.1007/978-3-030-60276-5_64" target="_blank" >https://link.springer.com/chapter/10.1007/978-3-030-60276-5_64</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-030-60276-5_64" target="_blank" >10.1007/978-3-030-60276-5_64</a>
Alternative languages
Result language
angličtina
Original language name
Diarization Based on Identification with X-Vectors
Original language description
In this paper, we describe a diarization of mono channel telephone recordings from The Language Consulting Center providing the Czech language consultancy service. In our proposed approach to a diarization, we use information about the known identity of one speaker (the language counsellor) acquired from the text transcription at the beginning of the conversation. In the state-of-the-art diarization based on the x-vectors clustering, we replace the clustering step by the identification of each segment of the recording against the counsellor’s identity x-vector and the general x-vector model that represents the client. Our proposed diarization without resegmentation step can be used as an online approach. Because of the uniqueness of our data, we compare our results with the Kaldi diarization as the baseline system.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
20205 - Automation and control systems
Result continuities
Project
<a href="/en/project/DG16P02B009" target="_blank" >DG16P02B009: Access to a Lingustically Structured Database of Enquiries from the Language Consulting Centre</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2020
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Speech and Computer, 22nd International Conference, SPECOM 2019, St. Petersburg, Russia, October 7-9,2020, Proceedings
ISBN
978-3-030-60275-8
ISSN
0302-9743
e-ISSN
1611-3349
Number of pages
12
Pages from-to
667-678
Publisher name
Springer
Place of publication
Cham
Event location
St. Petersburg, Russia
Event date
Oct 7, 2020
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—