Adaptive blind audio source extraction supervised by dominant speaker identification using x-vectors

Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F46747885%3A24220%2F20%3A00007107" target="_blank" >RIV/46747885:24220/20:00007107 - isvavai.cz</a>
Result on the web
<a href="https://arxiv.org/abs/1910.11824" target="_blank" >https://arxiv.org/abs/1910.11824</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1109/ICASSP40776.2020.9054693" target="_blank" >10.1109/ICASSP40776.2020.9054693</a>

Result language
angličtina
Original language name
Adaptive blind audio source extraction supervised by dominant speaker identification using x-vectors
Original language description
We propose a novel algorithm for adaptive blind audio source extraction. The proposed method is based on independent vector analysis and utilizes the auxiliary function optimization to achieve high convergence speed. The algorithm is partially supervised by a pilot signal related to the source of interest (SOI), which ensures that the method correctly extracts the utterance of the desired speaker. The pilot is based on the identification of a dominant speaker in the mixture using x-vectors. The properties of the x-vectors computed in the presence of cross-talk are experimentally analyzed. The proposed approach is verified in a scenario with a moving SOI, static interfering speaker and environmental noise.
Czech name
—
Czech description
—

Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>S - Specificky vyzkum na vysokych skolach

Publication year
2020
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Article name in the collection
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISBN
978-1-5090-6631-5
ISSN
1520-6149
e-ISSN
—
Number of pages
5
Pages from-to
676-680
Publisher name
IEEE
Place of publication
Barcelona
Event location
Barcelona
Event date
Jan 1, 2020
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
000615970400135

Similar results(10)