All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

Accuracy of MP3 Speech Recognition Under Real-World Conditions. Experimental Study

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68407700%3A21230%2F11%3A00181583" target="_blank" >RIV/68407700:21230/11:00181583 - isvavai.cz</a>

  • Result on the web

    <a href="http://www.sigmap.icete.org" target="_blank" >http://www.sigmap.icete.org</a>

  • DOI - Digital Object Identifier

Alternative languages

  • Result language

    angličtina

  • Original language name

    Accuracy of MP3 Speech Recognition Under Real-World Conditions. Experimental Study

  • Original language description

    This paper presents the study of speech recognition accuracy with respect to different levels of MP3 compression. Special attention is focused on the processing of speech signals with different quality, i.e. with different level of background noise and channel distortion. The work was motivated by possible usage of ASR for offline automatic transcription of audio recordings collected by standard wide-spread MP3 devices. The realized experiments have proved that although MP3 format does not distort speech significantly especially for high or moderate bit rates and high quality of source data. The accuracy of connected digits ASR decreased very slowly up to the bit rate 24 kbps. For the best case of PLP parameterization in close-talk channel just 3% decrease of recognition accuracy was observed while the size of the compressed file was approximately 10% of the original size. All results were slightly worse under presence of additive background noise and channel distortion.

  • Czech name

  • Czech description

Classification

  • Type

    D - Article in proceedings

  • CEP classification

    JA - Electronics and optoelectronics

  • OECD FORD branch

Result continuities

  • Project

    <a href="/en/project/GA102%2F08%2F0707" target="_blank" >GA102/08/0707: Speech Recognition under Real-World Conditions</a><br>

  • Continuities

    Z - Vyzkumny zamer (s odkazem do CEZ)

Others

  • Publication year

    2011

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

  • Article name in the collection

    Proceedings of SIGMAP 2011 - International Conference on Signal Processing and Multimedia Applications.

  • ISBN

    978-989-8425-72-0

  • ISSN

  • e-ISSN

  • Number of pages

    6

  • Pages from-to

    5-10

  • Publisher name

    University of Seville

  • Place of publication

    Sevilla

  • Event location

    Sevilla

  • Event date

    Jul 18, 2011

  • Type of event by nationality

    WRD - Celosvětová akce

  • UT code for WoS article