All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

Audio-Video Speaker Diarization for Unsupervised Speaker and Face Model Creation

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68407700%3A21230%2F14%3A00223246" target="_blank" >RIV/68407700:21230/14:00223246 - isvavai.cz</a>

  • Alternative codes found

    RIV/49777513:23520/14:43922925

  • Result on the web

    <a href="http://dx.doi.org/10.1007/978-3-319-10816-2_56" target="_blank" >http://dx.doi.org/10.1007/978-3-319-10816-2_56</a>

  • DOI - Digital Object Identifier

    <a href="http://dx.doi.org/10.1007/978-3-319-10816-2_56" target="_blank" >10.1007/978-3-319-10816-2_56</a>

Alternative languages

  • Result language

    angličtina

  • Original language name

    Audio-Video Speaker Diarization for Unsupervised Speaker and Face Model Creation

  • Original language description

    Our goal is to create speaker models in audio domain and face models in video do main from a set of videos in an unsupervised manner. Such models can be used later for speaker identification in audio domain (answering the question "Who was speaking and when") and/or fo r face recognition ("Who was seen and when") for given videos that contain speaking persons. T he proposed system is based on an audio-video diarization system that tries to resolve the dis advantages of the individual modalities. Experiments on broadcasts of Czech parliament meeting s show that the proposed combination of individual audio and video diarization systems yields an improvement of the diarization error rate (DER).

  • Czech name

  • Czech description

Classification

  • Type

    D - Article in proceedings

  • CEP classification

    JD - Use of computers, robotics and its application

  • OECD FORD branch

Result continuities

  • Project

    <a href="/en/project/GBP103%2F12%2FG084" target="_blank" >GBP103/12/G084: Center for Large Scale Multi-modal Data Interpretation</a><br>

  • Continuities

    P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)

Others

  • Publication year

    2014

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

  • Article name in the collection

    Text, Speech, and Dialogue. 17th International Conference, TSD 2014

  • ISBN

    978-3-319-10815-5

  • ISSN

    0302-9743

  • e-ISSN

  • Number of pages

    8

  • Pages from-to

    465-472

  • Publisher name

    Springer

  • Place of publication

    Heidelberg

  • Event location

    Brno

  • Event date

    Sep 8, 2014

  • Type of event by nationality

    EUR - Evropská akce

  • UT code for WoS article