Diarization of the Language Consulting Center Telephone Calls
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F19%3A43956395" target="_blank" >RIV/49777513:23520/19:43956395 - isvavai.cz</a>
Result on the web
<a href="https://link.springer.com/chapter/10.1007%2F978-3-030-26061-3_56" target="_blank" >https://link.springer.com/chapter/10.1007%2F978-3-030-26061-3_56</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-030-26061-3_56" target="_blank" >10.1007/978-3-030-26061-3_56</a>
Alternative languages
Result language
angličtina
Original language name
Diarization of the Language Consulting Center Telephone Calls
Original language description
In this paper, we describe a diarization of the archive data from the project “Access to a Linguistically Structured Database of Enquiries from the Language Consulting Center”. This project is attempting to provide improved access to the large archives of the Czech language of mainly telephone conversations collected continuously by The Language Consulting Center. One part of this archives contains mono recordings, where the data of the client and the language counsellor are mixed in one channel. In our proposed approach to a diarization, we used the information about the identity of the language counsellor acquired from the text transcription on the beginning of the conversation. For the initial stage of the diarization, our system based on clustering the x-vectors was adopted. The resegmentation step is used for refining the boundaries of speaker changes by the pre-trained Gaussian mixture model of the counsellor. Because of the uniqueness of our data, we compared our results with the Kaldi diarization as the baseline system.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
20205 - Automation and control systems
Result continuities
Project
<a href="/en/project/DG16P02B009" target="_blank" >DG16P02B009: Access to a Lingustically Structured Database of Enquiries from the Language Consulting Centre</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2019
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Speech and Computer, 21st International Conference, SPECOM 2019, Istanbul, turkey, August 20-25,2019, Proceedings
ISBN
978-3-030-26060-6
ISSN
0302-9743
e-ISSN
1611-3349
Number of pages
10
Pages from-to
549-558
Publisher name
Springer
Place of publication
Cham
Event location
Instanbul, Turkey
Event date
Aug 20, 2019
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—