Text-to-Speech Alignment for Imperfect Transcriptions

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F46747885%3A24220%2F13%3A%230002594" target="_blank" >RIV/46747885:24220/13:#0002594 - isvavai.cz</a>
Výsledek na webu
<a href="http://dx.doi.org/10.1007/978-3-642-40585-3_6" target="_blank" >http://dx.doi.org/10.1007/978-3-642-40585-3_6</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-642-40585-3_6" target="_blank" >10.1007/978-3-642-40585-3_6</a>

Alternativní jazyky

Jazyk výsledku
angličtina
Název v původním jazyce
Text-to-Speech Alignment for Imperfect Transcriptions
Popis výsledku v původním jazyce
In this paper we propose a method for text-to-speech alignment intended for imperfect (text) transcriptions. We designed an ASR-based (automatic speech recognition) tool complemented with a special post-processing layer that finds anchor points in the transcription and then aligns the data between these anchor points. As the system is not dependent on usually employed keyword-spotter it is not as vulnerable to the noisy recordings as some other approaches. We also present other features of the system (e.g. keeping of the document structure and processing of the numbers) that allow us to use it in many other specific tasks. The performance is evaluated over a challenging set of recordings containing spontaneous speech with many hesitations, repetitionsetc. as well as over noisy recordings.
Název v anglickém jazyce
Text-to-Speech Alignment for Imperfect Transcriptions
Popis výsledku anglicky
In this paper we propose a method for text-to-speech alignment intended for imperfect (text) transcriptions. We designed an ASR-based (automatic speech recognition) tool complemented with a special post-processing layer that finds anchor points in the transcription and then aligns the data between these anchor points. As the system is not dependent on usually employed keyword-spotter it is not as vulnerable to the noisy recordings as some other approaches. We also present other features of the system (e.g. keeping of the document structure and processing of the numbers) that allow us to use it in many other specific tasks. The performance is evaluated over a challenging set of recordings containing spontaneous speech with many hesitations, repetitionsetc. as well as over noisy recordings.

Klasifikace

Druh
D - Stať ve sborníku
CEP obor
JC - Počítačový hardware a software
OECD FORD obor
—

Návaznosti výsledku

Projekt
<a href="/cs/project/TA01011204" target="_blank" >TA01011204: Živé archivy</a><br>
Návaznosti
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)

Ostatní

Rok uplatnění
2013
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název statě ve sborníku
Proc. of 16th International Conference TSD 2013
ISBN
9783642405846
ISSN
0302-9743
e-ISSN
—
Počet stran výsledku
8
Strana od-do
536-543
Název nakladatele
Springer-Verlag Berlin Heidelber
Místo vydání
Německo, Berlín
Místo konání akce
Česká Republika
Datum konání akce
1. 1. 2013
Typ akce podle státní příslušnosti
WRD - Celosvětová akce
Kód UT WoS článku
—

Podobné výsledky(10)

ALIGN - software pro podporu poloautomatického zarovnání nahrávek s existujícími přepisy ParCzech4Speech 1.0 Automatic speech segmentation based on alignment with a text-to-speech system.

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Text-to-Speech Alignment for Imperfect Transcriptions

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Údaje specifické pro druh výsledku

Údaje specifické pro druh výsledku

Podobné výsledky(10)