Free on-line speech recogniser based on Kaldi ASR toolkit producing word posterior lattices

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F14%3A10289403" target="_blank" >RIV/00216208:11320/14:10289403 - isvavai.cz</a>
Výsledek na webu
<a href="http://www.aclweb.org/anthology/W14-4315" target="_blank" >http://www.aclweb.org/anthology/W14-4315</a>
DOI - Digital Object Identifier
—

Jazyk výsledku
angličtina
Název v původním jazyce
Free on-line speech recogniser based on Kaldi ASR toolkit producing word posterior lattices
Popis výsledku v původním jazyce
This paper presents an extension of the Kaldi automatic speech recognition toolkit to support on-line recognition. The resulting recogniser supports acoustic models trained using state-of-the-art acoustic modelling techniques. As the recogniser producesword posterior lattices, it is particularly useful in statistical dialogue systems, which try to exploit uncertainty in the recognizer's output. Our experiments show that the on- line recogniser performs significantly better in terms of latency when compared to a cloud-based recogniser.
Název v anglickém jazyce
Free on-line speech recogniser based on Kaldi ASR toolkit producing word posterior lattices
Popis výsledku anglicky
This paper presents an extension of the Kaldi automatic speech recognition toolkit to support on-line recognition. The resulting recogniser supports acoustic models trained using state-of-the-art acoustic modelling techniques. As the recogniser producesword posterior lattices, it is particularly useful in statistical dialogue systems, which try to exploit uncertainty in the recognizer's output. Our experiments show that the on- line recogniser performs significantly better in terms of latency when compared to a cloud-based recogniser.

Projekt
<a href="/cs/project/LK11221" target="_blank" >LK11221: Vývoj metod pro návrh statistických mluvených dialogových systémů</a><br>
Návaznosti
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)

Rok uplatnění
2014
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Název statě ve sborníku
Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue
ISBN
978-1-941643-21-1
ISSN
—
e-ISSN
—
Počet stran výsledku
5
Strana od-do
108-112
Název nakladatele
Association for Computational Linguistics
Místo vydání
Stroudsburg, PA, USA
Místo konání akce
Philadelphia, PA, USA
Datum konání akce
18. 6. 2014
Typ akce podle státní příslušnosti
WRD - Celosvětová akce
Kód UT WoS článku
—

Podobné výsledky(10)