Blind extraction of moving audio source in a challenging environment supported by speaker identification via X-vectors

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F46747885%3A24220%2F21%3A00008779" target="_blank" >RIV/46747885:24220/21:00008779 - isvavai.cz</a>
Výsledek na webu
<a href="https://asap.ite.tul.cz/wp-content/uploads/sites/3/2021/03/ICASSP2021___BSS_embeddings.pdf" target="_blank" >https://asap.ite.tul.cz/wp-content/uploads/sites/3/2021/03/ICASSP2021___BSS_embeddings.pdf</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1109/ICASSP39728.2021.9414331" target="_blank" >10.1109/ICASSP39728.2021.9414331</a>

Alternativní jazyky

Jazyk výsledku
angličtina
Název v původním jazyce
Blind extraction of moving audio source in a challenging environment supported by speaker identification via X-vectors
Popis výsledku v původním jazyce
We propose a novel approach for semi-supervised extraction of a moving audio source of interest (SOI) applicable in reverberant and noisy environments. The blind part of the method is based on independent vector extraction (IVE) and uses the recently proposed constant separating vector (CSV) mixing model. This model allows for changes of mixing parameters within the processed interval of the mixture, which potentially leads to higher accuracy of SOI estimation. The supervised part of the method concerns a pilot signal, which is related to the SOI and ensures the convergence of the blind method towards the SOI. The pilot is based on robust detection of frames where SOI is dominant via speaker embeddings called X-vectors. Robustness of the detection is achieved through augmentation of the data for the supervised training of the X-vectors. The pilot-supported extraction yields significantly better performance compared to its unsupervised counterpart identifying SOI solely using the initialization.
Název v anglickém jazyce
Blind extraction of moving audio source in a challenging environment supported by speaker identification via X-vectors
Popis výsledku anglicky
We propose a novel approach for semi-supervised extraction of a moving audio source of interest (SOI) applicable in reverberant and noisy environments. The blind part of the method is based on independent vector extraction (IVE) and uses the recently proposed constant separating vector (CSV) mixing model. This model allows for changes of mixing parameters within the processed interval of the mixture, which potentially leads to higher accuracy of SOI estimation. The supervised part of the method concerns a pilot signal, which is related to the SOI and ensures the convergence of the blind method towards the SOI. The pilot is based on robust detection of frames where SOI is dominant via speaker embeddings called X-vectors. Robustness of the detection is achieved through augmentation of the data for the supervised training of the X-vectors. The pilot-supported extraction yields significantly better performance compared to its unsupervised counterpart identifying SOI solely using the initialization.

Klasifikace

Druh
D - Stať ve sborníku
CEP obor
—
OECD FORD obor
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Návaznosti výsledku

Projekt
Výsledek vznikl pri realizaci vícero projektů. Více informací v záložce Projekty.
Návaznosti
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>S - Specificky vyzkum na vysokych skolach

Ostatní

Rok uplatnění
2021
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název statě ve sborníku
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISBN
—
ISSN
1520-6149
e-ISSN
—
Počet stran výsledku
5
Strana od-do
226-230
Název nakladatele
IEEE
Místo vydání
USA
Místo konání akce
Toronto, Canada
Datum konání akce
1. 1. 2021
Typ akce podle státní příslušnosti
WRD - Celosvětová akce
Kód UT WoS článku
000704288400046

Podobné výsledky(10)

Adaptive blind audio source extraction supervised by dominant speaker identification using x-vectors Blind Extraction of Target Speech Source: Three ways of Guidance Exploiting Supervised Speaker Embeddings Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Blind extraction of moving audio source in a challenging environment supported by speaker identification via X-vectors

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Údaje specifické pro druh výsledku

Údaje specifické pro druh výsledku

Podobné výsledky(10)