All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

Methods for Rapid Development of Automatic Speech Recognition System for Russian

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F46747885%3A24220%2F15%3A00002968" target="_blank" >RIV/46747885:24220/15:00002968 - isvavai.cz</a>

  • Result on the web

    <a href="http://dx.doi.org/10.1109/ECMSM.2015.7208686" target="_blank" >http://dx.doi.org/10.1109/ECMSM.2015.7208686</a>

  • DOI - Digital Object Identifier

    <a href="http://dx.doi.org/10.1109/ECMSM.2015.7208686" target="_blank" >10.1109/ECMSM.2015.7208686</a>

Alternative languages

  • Result language

    angličtina

  • Original language name

    Methods for Rapid Development of Automatic Speech Recognition System for Russian

  • Original language description

    In this paper we present our approach to the rapid and efficient development of an automatic speech recognition (ASR) system for Russian. We try to utilize our tools, procedures and data previously designed and collected for other Slavic languages, Czech and Slovak. We show how we build a large corpus of texts acquired from major publishers' web pages and convert it from Cyrillic to Latin to simplify further processing. The corpus is used to create a representative lexicon with 218K words and 259K pronunciations and a probabilistic language model. When training the acoustic model (AM), we use the GlobalPhone database of recordings and a largely automated scheme that includes bootstrapping with an existing Czech AM and several iterative steps to gradually improve both phonetic annotations and the target Russian AM. The recent prototype of the Russian ASR system is evaluated on the test part of the GlobalPhone database and achieves 18.2 % word error rate..

  • Czech name

  • Czech description

Classification

  • Type

    D - Article in proceedings

  • CEP classification

    JC - Computer hardware and software

  • OECD FORD branch

Result continuities

  • Project

    <a href="/en/project/TA04010199" target="_blank" >TA04010199: MULTILINMEDIA - Multilingual Multimedia Monitoring and Analyzing Platform</a><br>

  • Continuities

    P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)

Others

  • Publication year

    2015

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

  • Article name in the collection

    2015 IEEE International Workshop of Electronics, Control, Measurement, Signals and their application to Mechatronics

  • ISBN

    978-1-4799-6972-2

  • ISSN

  • e-ISSN

  • Number of pages

    6

  • Pages from-to

    26-31

  • Publisher name

    IEEE

  • Place of publication

    Česká Republika

  • Event location

    Česká Republika, Liberec

  • Event date

  • Type of event by nationality

    WRD - Celosvětová akce

  • UT code for WoS article

    000363814500011