Parliament Archives Used for Automatic Training of Multi-lingual Automatic Speech Recognition Systems

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F46747885%3A24220%2F17%3A00004815" target="_blank" >RIV/46747885:24220/17:00004815 - isvavai.cz</a>
Výsledek na webu
<a href="http://dx.doi.org/10.1007/978-3-319-64206-2_20" target="_blank" >http://dx.doi.org/10.1007/978-3-319-64206-2_20</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-319-64206-2_20" target="_blank" >10.1007/978-3-319-64206-2_20</a>

Alternativní jazyky

Jazyk výsledku
angličtina
Název v původním jazyce
Parliament Archives Used for Automatic Training of Multi-lingual Automatic Speech Recognition Systems
Popis výsledku v původním jazyce
In the paper we present a fully automated process capable of creating speech databases needed for training acoustic models for speech recognition systems. We show that archives of national parliaments are perfect sources of speech and text data suited for a lightly supervised training scheme, which does not require human intervention. We describe the process and its procedures in details and demonstrate its usage on three Slavic languages (Polish, Russian and Bulgarian). Practical evaluation is done on a broadcast news task and yields better results than those obtained on some established speech databases.
Název v anglickém jazyce
Parliament Archives Used for Automatic Training of Multi-lingual Automatic Speech Recognition Systems
Popis výsledku anglicky
In the paper we present a fully automated process capable of creating speech databases needed for training acoustic models for speech recognition systems. We show that archives of national parliaments are perfect sources of speech and text data suited for a lightly supervised training scheme, which does not require human intervention. We describe the process and its procedures in details and demonstrate its usage on three Slavic languages (Polish, Russian and Bulgarian). Practical evaluation is done on a broadcast news task and yields better results than those obtained on some established speech databases.

Klasifikace

Druh
D - Stať ve sborníku
CEP obor
—
OECD FORD obor
20204 - Robotics and automatic control

Návaznosti výsledku

Projekt
<a href="/cs/project/TA04010199" target="_blank" >TA04010199: MULTILINMEDIA - Multilinguální platforma pro monitoring a analýzu multimédií</a><br>
Návaznosti
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace

Ostatní

Rok uplatnění
2017
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název statě ve sborníku
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
ISBN
9783319642055
ISSN
0302-9743
e-ISSN
—
Počet stran výsledku
9
Strana od-do
174-182
Název nakladatele
Springer Verlag
Místo vydání
Německo
Místo konání akce
Praha, Česká Republika
Datum konání akce
1. 1. 2017
Typ akce podle státní příslušnosti
WRD - Celosvětová akce
Kód UT WoS článku
—

Podobné výsledky(10)

Automatic Speech Recognition Benchmark for Air-Traffic Communications ALIGN - software pro podporu poloautomatického zarovnání nahrávek s existujícími přepisy Česká audiovizuální syntéza řeči

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Parliament Archives Used for Automatic Training of Multi-lingual Automatic Speech Recognition Systems

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Údaje specifické pro druh výsledku

Údaje specifické pro druh výsledku

Podobné výsledky(10)