Indexing of audiovisual archives using automatic speech and image recognition methods

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F20%3A43958857" target="_blank" >RIV/49777513:23520/20:43958857 - isvavai.cz</a>
Výsledek na webu
<a href="https://www.nacr.cz/wp-content/uploads/2019/12/KnihaAUDS_e-kniha_DEF.pdf" target="_blank" >https://www.nacr.cz/wp-content/uploads/2019/12/KnihaAUDS_e-kniha_DEF.pdf</a>
DOI - Digital Object Identifier
—

Alternativní jazyky

Jazyk výsledku
němčina
Název v původním jazyce
Die Indexierung von audiovisuellen Archiven unter Benutzung von Methoden der automatischen Sprach- und Bilderkennung
Popis výsledku v původním jazyce
Die überwiegende Mehrheit der Suchsysteme in Sprachdaten ist als Serienkombination eines Sprachdekoders mit einem Informationssuchsystem in Textdaten konzipiert. Diese Konfiguration ermöglicht es relativ leicht die schon existierenden Systeme für automatische Spracherkennung (angl. automatic speech recognition – ASR) und für Informationssuche (IR) zu benutzen. Auf der anderen Seite wird bei solch einer Kombination von zwei im Prinzip unabhängigen Modulen die Tatsache ignoriert, dass die durchsuchten Textdokumente mit Erkennungsfehlern behaftet sind, aber dass auch ein möglicher „reicherer“ Output des Sprachdekoders zb. in Form von Gittern nicht benutzt wird. Es ist also das Ziel dieses Beitrages, vor allem Methoden vorzustellen, die auf einer engeren Verknüpfung der ASR- und IR-Module beruhen.
Název v anglickém jazyce
Indexing of audiovisual archives using automatic speech and image recognition methods
Popis výsledku anglicky
The vast majority of speech data retrieval systems are designed as a serial combination of a speech recognizer and a text data retrieval system. This configuration makes it relatively easy to use existing automatic speech recognition (ASR) and information retrieval (IR) systems. On the other hand, with such a combination of two essentially independent modules, the fact that the searched text documents are burdened with recognition errors is usually ignored, and the possible "richer" output from the recognizer, e.g. in the form of lattices, is not used. The aim of this article is to present mainly methods that are based on a closer connection of ASR and IR modules.

Klasifikace

Druh
C - Kapitola v odborné knize
CEP obor
—
OECD FORD obor
20205 - Automation and control systems

Návaznosti výsledku

Projekt
<a href="/cs/project/LM2018101" target="_blank" >LM2018101: Digitální výzkumná infrastruktura pro jazykové technologie, umění a humanitní vědy</a><br>
Návaznosti
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)

Ostatní

Rok uplatnění
2020
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název knihy nebo sborníku
Archivierung von Unterlagen aus Digitalen Systemen
ISBN
978-80-7469-096-9
Počet stran výsledku
9
Strana od-do
120-128
Počet stran knihy
254
Název nakladatele
Národní archiv
Místo vydání
Praha
Kód UT WoS kapitoly
—

Podobné výsledky(10)

The IWSLT 2021 BUT Speech Translation Systems Techniky automatického rozpoznávání řeči a vyhledávání informací pro zlepšení přístupu k videoarchivům obsahujícím kulturní dědictví SW pro výběr a optimalizaci textového korpusu

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Indexing of audiovisual archives using automatic speech and image recognition methods

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Údaje specifické pro druh výsledku

Údaje specifické pro druh výsledku

Podobné výsledky(10)