All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

Small and Large Vocabulary Speech Recognition of MP3 Data under Real-Word Conditions: Experimental Study

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68407700%3A21230%2F12%3A00200338" target="_blank" >RIV/68407700:21230/12:00200338 - isvavai.cz</a>

  • Result on the web

    <a href="http://link.springer.com/chapter/10.1007/978-3-642-35755-8_29#" target="_blank" >http://link.springer.com/chapter/10.1007/978-3-642-35755-8_29#</a>

  • DOI - Digital Object Identifier

    <a href="http://dx.doi.org/10.1007/978-3-642-35755-8_29" target="_blank" >10.1007/978-3-642-35755-8_29</a>

Alternative languages

  • Result language

    angličtina

  • Original language name

    Small and Large Vocabulary Speech Recognition of MP3 Data under Real-Word Conditions: Experimental Study

  • Original language description

    This paper presents the study of speech recognition accuracy both for small and large vocabulary task with respect to different levels of MP3 compression of processed data. The motivation behind the work was to evaluate the usage of ASR system for off-line automatic transcription of recordings collected from standard present MP3 devices under different levels of background noise and channel distortion. Although MP3 may not be an optimal compression algorithm, the performed experiments have prooved thatit does not distort speech signal significantly for higher compression rates. Realized experiments showed also that the accuracy of speech recognition (both small- and large-vocabulary) decreased very slowly for the bit-rate of 24 kbps and higher. However, slightly different setup of speech feature computation is necessary for MP3 speech data, mainly PLP features give significantly better results in comparison to MFCC.

  • Czech name

  • Czech description

Classification

  • Type

    J<sub>x</sub> - Unclassified - Peer-reviewed scientific article (Jimp, Jsc and Jost)

  • CEP classification

    JA - Electronics and optoelectronics

  • OECD FORD branch

Result continuities

  • Project

    <a href="/en/project/GA102%2F08%2F0707" target="_blank" >GA102/08/0707: Speech Recognition under Real-World Conditions</a><br>

  • Continuities

    S - Specificky vyzkum na vysokych skolach

Others

  • Publication year

    2012

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

  • Name of the periodical

    Communications in Computer and Information Science

  • ISSN

    1865-0929

  • e-ISSN

  • Volume of the periodical

    314

  • Issue of the periodical within the volume

  • Country of publishing house

    DE - GERMANY

  • Number of pages

    11

  • Pages from-to

    409-419

  • UT code for WoS article

    000315973800029

  • EID of the result in the Scopus database