Developing Text and Speech Databases for Speech Recognition of Vietnamese

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F46747885%3A24220%2F13%3A%230002804" target="_blank" >RIV/46747885:24220/13:#0002804 - isvavai.cz</a>
Výsledek na webu
<a href="http://dx.doi.org/10.1109/IDAACS.2013.6662662" target="_blank" >http://dx.doi.org/10.1109/IDAACS.2013.6662662</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1109/IDAACS.2013.6662662" target="_blank" >10.1109/IDAACS.2013.6662662</a>

Alternativní jazyky

Jazyk výsledku
angličtina
Název v původním jazyce
Developing Text and Speech Databases for Speech Recognition of Vietnamese
Popis výsledku v původním jazyce
This paper describes our study on developing the text and speech databases for automatic speech recognition of Vietnamese using an available source of linguistic data: the Internet. First, a two-stage procedure is applied to extract a general text corpuswhich can be used for researches on Vietnamese language such as speech recognition, audio-visual speech recognition, and natural language processing... We also collect another specific text corpus in the field of news and literature using the resource from some main websites of Vietnamese. The total text corpus containing 8,681,869 sentences with more than 124 million syllables is then used to build and test the language model for the speech recognizer. Besides, the collecting of speech corpora for experiments on continuous speech recognition and audio-visual speech recognition of Vietnamese are also described.
Název v anglickém jazyce
Developing Text and Speech Databases for Speech Recognition of Vietnamese
Popis výsledku anglicky
This paper describes our study on developing the text and speech databases for automatic speech recognition of Vietnamese using an available source of linguistic data: the Internet. First, a two-stage procedure is applied to extract a general text corpuswhich can be used for researches on Vietnamese language such as speech recognition, audio-visual speech recognition, and natural language processing... We also collect another specific text corpus in the field of news and literature using the resource from some main websites of Vietnamese. The total text corpus containing 8,681,869 sentences with more than 124 million syllables is then used to build and test the language model for the speech recognizer. Besides, the collecting of speech corpora for experiments on continuous speech recognition and audio-visual speech recognition of Vietnamese are also described.

Klasifikace

Druh
D - Stať ve sborníku
CEP obor
JC - Počítačový hardware a software
OECD FORD obor
—

Návaznosti výsledku

Projekt
—
Návaznosti
S - Specificky vyzkum na vysokych skolach

Ostatní

Rok uplatnění
2013
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název statě ve sborníku
Proceedings of the 2013 IEEE 7th International Conference on Intelligent Data Acquisition and Advanced Computing Systems, IDAACS 2013
ISBN
9781479914265
ISSN
—
e-ISSN
—
Počet stran výsledku
4
Strana od-do
163-166
Název nakladatele
—
Místo vydání
—
Místo konání akce
Německo
Datum konání akce
1. 1. 2013
Typ akce podle státní příslušnosti
WRD - Celosvětová akce
Kód UT WoS článku
—

Podobné výsledky(10)

Czech audio-visual speech corpus of a car driver for in-vehicle audio-visual speech recognition Návrh ruského audiovizuálního řečového korpusu pro bimodální rozpoznávání řeči HAVRUS Corpus: High-Speed Recordings of Audo-Visual Russian Speech

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Developing Text and Speech Databases for Speech Recognition of Vietnamese

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Údaje specifické pro druh výsledku

Údaje specifické pro druh výsledku

Podobné výsledky(10)