Developing Text and Speech Databases for Speech Recognition of Vietnamese
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F46747885%3A24220%2F13%3A%230002804" target="_blank" >RIV/46747885:24220/13:#0002804 - isvavai.cz</a>
Result on the web
<a href="http://dx.doi.org/10.1109/IDAACS.2013.6662662" target="_blank" >http://dx.doi.org/10.1109/IDAACS.2013.6662662</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1109/IDAACS.2013.6662662" target="_blank" >10.1109/IDAACS.2013.6662662</a>
Alternative languages
Result language
angličtina
Original language name
Developing Text and Speech Databases for Speech Recognition of Vietnamese
Original language description
This paper describes our study on developing the text and speech databases for automatic speech recognition of Vietnamese using an available source of linguistic data: the Internet. First, a two-stage procedure is applied to extract a general text corpuswhich can be used for researches on Vietnamese language such as speech recognition, audio-visual speech recognition, and natural language processing... We also collect another specific text corpus in the field of news and literature using the resource from some main websites of Vietnamese. The total text corpus containing 8,681,869 sentences with more than 124 million syllables is then used to build and test the language model for the speech recognizer. Besides, the collecting of speech corpora for experiments on continuous speech recognition and audio-visual speech recognition of Vietnamese are also described.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
JC - Computer hardware and software
OECD FORD branch
—
Result continuities
Project
—
Continuities
S - Specificky vyzkum na vysokych skolach
Others
Publication year
2013
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of the 2013 IEEE 7th International Conference on Intelligent Data Acquisition and Advanced Computing Systems, IDAACS 2013
ISBN
9781479914265
ISSN
—
e-ISSN
—
Number of pages
4
Pages from-to
163-166
Publisher name
—
Place of publication
—
Event location
Německo
Event date
Jan 1, 2013
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—