Modified Feature Extraction Methods in Robust Speech Recognition

The result's identifiers

Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68407700%3A21230%2F07%3A03129812" target="_blank" >RIV/68407700:21230/07:03129812 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—

Alternative languages

Result language
angličtina
Original language name
Modified Feature Extraction Methods in Robust Speech Recognition
Original language description
The speech recognisers use a parametric form of the signal to get the most important features in speech for the recognition task. Mel-frequency cepstral coefficients (MFCC) and Perceptual linear prediction coefficients (PLP) belong to the most commonly used methods. There is no rule to decide which one is better to use and it depends mainly on the particular conditions. The tests on taking advantage of different parts of each parametrization process to get the best results in given conditions are presented in this paper. Robust Hidden Markov model-based (HMM) Czech digit recogniser in slightly noisy environment is used for this purpose. The experiments show, that using Bark-frequency scaling, equal loudness pre-emphasis and intensity-loudness power lawin the original MFCC method can bring improvement in white noise robustness for particular conditions. The results also uncovered that the LP-based methods tend to generate insertion errors in given environment.
Czech name
Modifikované metody extrakce příznaků pro robustní rozpoznávání řeči
Czech description
Článek prezentuje experimenty v oblasti modifikace standardních parametrizačních technik využívaných při robustním rozpoznávání řeči. Navržené modifikace kombinují jednotlivé bloky standardních parametrizací pro zvýšení robustnosti systému pracujícího vzašuměném prostředí. Pro porovnání vlivu navržených technik je využit rozpoznávač číslovek na bázi HMM kontextově nezávislých fonémů. Experimenty ukazují, že zpracování signálu na bázi lineární predikce vede v daných podmínkách k vyššímu výskytu chyb typu inzerce. Jeho nahrazení přímým výpočtem spektra metodou DCT lze také docílit zvýšené odolnosti systému vůči bílému šumu.

Classification

Type
D - Article in proceedings
CEP classification
JA - Electronics and optoelectronics
OECD FORD branch
—

Result continuities

Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
Z - Vyzkumny zamer (s odkazem do CEZ)

Others

Publication year
2007
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

Article name in the collection
Proceedings of 17th International Conference Radioelektronika 2007
ISBN
1-4244-0821-0
ISSN
—
e-ISSN
—
Number of pages
4
Pages from-to
521-524
Publisher name
Institute of Electrical and Electronic Engineers
Place of publication
Piscataway
Event location
VUT v Brně, FEKT, ÚREL
Event date
Apr 24, 2007
Type of event by nationality
EUR - Evropská akce
UT code for WoS article
—

Similar results(10)

Improving the computational complexity and word recognition rate for dysarthria speech using robust frame selection algorithm Speech reconstruction from the mel frequency cepstral coefficients Modification of the Speech Feature Extraction Module for the Improvement of the System for Automatic lectures transcription

What are you looking for?

Quick search

Smart search

Modified Feature Extraction Methods in Robust Speech Recognition

The result's identifiers

Alternative languages

Classification

Result continuities

Others

Data specific for result type

Similar results(10)

What are you looking for?

Quick search

Smart search

Result description

The result's identifiers

The result's identifiers

Alternative languages

Alternative languages

Classification

Classification

Result continuities

Result continuities

Others

Others

Data specific for result type

Data specific for result type

Similar results(10)