System for fast lexical and phonetic spoken term detection in a Czech cultural heritage archive
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F11%3A43898200" target="_blank" >RIV/49777513:23520/11:43898200 - isvavai.cz</a>
Výsledek na webu
<a href="http://dx.doi.org/10.1186/1687-4722-2011-10" target="_blank" >http://dx.doi.org/10.1186/1687-4722-2011-10</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1186/1687-4722-2011-10" target="_blank" >10.1186/1687-4722-2011-10</a>
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
System for fast lexical and phonetic spoken term detection in a Czech cultural heritage archive
Popis výsledku v původním jazyce
The main objective of the work presented in this paper was to develop a complete system that would accomplish the original visions of the MALACH project. Those goals were to employ automatic speech recognition and information retrieval techniques to provide improved access to the large video archive containing recorded testimonies of the Holocaust survivors. The system has been so far developed for the Czech part of the archive only. It takes advantage of the state-of-the-art speech recognition system tailored to the challenging properties of the recordings in the archive (elderly speakers, spontaneous speech and emotionally loaded content) and its close coupling with the actual search engine.The design of the algorithm adopting the spoken term detection approach is focused on the speed of the retrieval. The resulting system is able to search through the 1,000 h of video constituting the Czech portion of the archive and find query word occurrences in the matter of seconds.
Název v anglickém jazyce
System for fast lexical and phonetic spoken term detection in a Czech cultural heritage archive
Popis výsledku anglicky
The main objective of the work presented in this paper was to develop a complete system that would accomplish the original visions of the MALACH project. Those goals were to employ automatic speech recognition and information retrieval techniques to provide improved access to the large video archive containing recorded testimonies of the Holocaust survivors. The system has been so far developed for the Czech part of the archive only. It takes advantage of the state-of-the-art speech recognition system tailored to the challenging properties of the recordings in the archive (elderly speakers, spontaneous speech and emotionally loaded content) and its close coupling with the actual search engine.The design of the algorithm adopting the spoken term detection approach is focused on the speed of the retrieval. The resulting system is able to search through the 1,000 h of video constituting the Czech portion of the archive and find query word occurrences in the matter of seconds.
Klasifikace
Druh
J<sub>x</sub> - Nezařazeno - Článek v odborném periodiku (Jimp, Jsc a Jost)
CEP obor
JD - Využití počítačů, robotika a její aplikace
OECD FORD obor
—
Návaznosti výsledku
Projekt
<a href="/cs/project/1QS101470516" target="_blank" >1QS101470516: Automatické vyhledávání klíčových slov v proudu zvukových dat</a><br>
Návaznosti
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>S - Specificky vyzkum na vysokych skolach
Ostatní
Rok uplatnění
2011
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název periodika
EURASIP Journal on Audio, Speech and Music Processing
ISSN
1687-4714
e-ISSN
—
Svazek periodika
2011
Číslo periodika v rámci svazku
10
Stát vydavatele periodika
US - Spojené státy americké
Počet stran výsledku
19
Strana od-do
1-19
Kód UT WoS článku
000299122700001
EID výsledku v databázi Scopus
—