Convolutional Neural Network in the Task of Speaker Change Detection
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F16%3A43929716" target="_blank" >RIV/49777513:23520/16:43929716 - isvavai.cz</a>
Result on the web
<a href="http://link.springer.com/chapter/10.1007/978-3-319-43958-7_22" target="_blank" >http://link.springer.com/chapter/10.1007/978-3-319-43958-7_22</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-319-43958-7_22" target="_blank" >10.1007/978-3-319-43958-7_22</a>
Alternative languages
Result language
angličtina
Original language name
Convolutional Neural Network in the Task of Speaker Change Detection
Original language description
This paper presents an approach to detect speaker changes in telephone conversations. The speaker change problem is presented as a classification problem. We use a Convolutional Neural Network to analyze short audio segments. The Network plays a role of a regressor. It outputs higher values for segments that are more likely to contain a speaker change. Upon thresholding the regressed value the decision about the segment is made. The experiment shows that the Convolutional Neural Network outperforms a baseline system based on the Bayesian Information Criterion. It behaves very well on previously unseen data produced by previously unheard speakers.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
JD - Use of computers, robotics and its application
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/GBP103%2F12%2FG084" target="_blank" >GBP103/12/G084: Center for Large Scale Multi-modal Data Interpretation</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2016
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Speech and Computer 18th International Conference, SPECOM 2016, Budapest, Hungary, August 23-27, 2016, Proceedings
ISBN
978-3-319-43957-0
ISSN
0302-9743
e-ISSN
—
Number of pages
8
Pages from-to
191-198
Publisher name
Springer
Place of publication
Heidelberg
Event location
Budapesť, Maďarsko
Event date
Aug 23, 2016
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
000389335600022