Diarization of the Language Consulting Center Telephone Calls

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F19%3A43956395" target="_blank" >RIV/49777513:23520/19:43956395 - isvavai.cz</a>
Výsledek na webu
<a href="https://link.springer.com/chapter/10.1007%2F978-3-030-26061-3_56" target="_blank" >https://link.springer.com/chapter/10.1007%2F978-3-030-26061-3_56</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-030-26061-3_56" target="_blank" >10.1007/978-3-030-26061-3_56</a>

Alternativní jazyky

Jazyk výsledku
angličtina
Název v původním jazyce
Diarization of the Language Consulting Center Telephone Calls
Popis výsledku v původním jazyce
In this paper, we describe a diarization of the archive data from the project “Access to a Linguistically Structured Database of Enquiries from the Language Consulting Center”. This project is attempting to provide improved access to the large archives of the Czech language of mainly telephone conversations collected continuously by The Language Consulting Center. One part of this archives contains mono recordings, where the data of the client and the language counsellor are mixed in one channel. In our proposed approach to a diarization, we used the information about the identity of the language counsellor acquired from the text transcription on the beginning of the conversation. For the initial stage of the diarization, our system based on clustering the x-vectors was adopted. The resegmentation step is used for refining the boundaries of speaker changes by the pre-trained Gaussian mixture model of the counsellor. Because of the uniqueness of our data, we compared our results with the Kaldi diarization as the baseline system.
Název v anglickém jazyce
Diarization of the Language Consulting Center Telephone Calls
Popis výsledku anglicky
In this paper, we describe a diarization of the archive data from the project “Access to a Linguistically Structured Database of Enquiries from the Language Consulting Center”. This project is attempting to provide improved access to the large archives of the Czech language of mainly telephone conversations collected continuously by The Language Consulting Center. One part of this archives contains mono recordings, where the data of the client and the language counsellor are mixed in one channel. In our proposed approach to a diarization, we used the information about the identity of the language counsellor acquired from the text transcription on the beginning of the conversation. For the initial stage of the diarization, our system based on clustering the x-vectors was adopted. The resegmentation step is used for refining the boundaries of speaker changes by the pre-trained Gaussian mixture model of the counsellor. Because of the uniqueness of our data, we compared our results with the Kaldi diarization as the baseline system.

Klasifikace

Druh
D - Stať ve sborníku
CEP obor
—
OECD FORD obor
20205 - Automation and control systems

Návaznosti výsledku

Projekt
<a href="/cs/project/DG16P02B009" target="_blank" >DG16P02B009: Zpřístupnění dotazů jazykové poradny v lingvisticky strukturované databázi</a><br>
Návaznosti
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)

Ostatní

Rok uplatnění
2019
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název statě ve sborníku
Speech and Computer, 21st International Conference, SPECOM 2019, Istanbul, turkey, August 20-25,2019, Proceedings
ISBN
978-3-030-26060-6
ISSN
0302-9743
e-ISSN
1611-3349
Počet stran výsledku
10
Strana od-do
549-558
Název nakladatele
Springer
Místo vydání
Cham
Místo konání akce
Instanbul, Turkey
Datum konání akce
20. 8. 2019
Typ akce podle státní příslušnosti
WRD - Celosvětová akce
Kód UT WoS článku
—

Podobné výsledky(10)

Diarization Based on Identification with X-Vectors Applying EEND Diarization to Telephone Recordings from a Call Center First Insight into the Processing of the Language Consulting Center Data

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Diarization of the Language Consulting Center Telephone Calls

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Údaje specifické pro druh výsledku

Údaje specifické pro druh výsledku

Podobné výsledky(10)