Empty pause detection in noisy and clean speech conditions
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216305%3A26220%2F06%3APU63412" target="_blank" >RIV/00216305:26220/06:PU63412 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Empty pause detection in noisy and clean speech conditions
Original language description
Successful pause detection becomes an important part in the process of speech recognition and speech coding as well as in the biometrical field (stress detection) and human-machine interaction. Nowadays, only a few of proposed algorithms are able to reflect various noise conditions. This is considered in presented paper which results from research that has been made at the International Institute for Advanced Scientific Studies (IIASS) and proposes novel method for non-speech activity pause detection inspontaneous speech recordings made in noisy environments. The input signal is transformed into log spectral energy and is divided into specific frequency bands. Each band is smoothed and tracked by dynamically adjusted thresholds based on noise energy estimation. Thresholds are adapted taking into account the dynamic changes of the speech signal under environmental noise. The proposed method run in real time and does not require a priori knowledge of the SNR and a priori threshold value
Czech name
Detekce prázdných pauz v podmínkách zašuměné a čisté řeči
Czech description
Successful pause detection becomes an important part in the process of speech recognition and speech coding as well as in the biometrical field (stress detection) and human-machine interaction. Nowadays, only a few of proposed algorithms are able to reflect various noise conditions. This is considered in presented paper which results from research that has been made at the International Institute for Advanced Scientific Studies (IIASS) and proposes novel method for non-speech activity pause detection inspontaneous speech recordings made in noisy environments. The input signal is transformed into log spectral energy and is divided into specific frequency bands. Each band is smoothed and tracked by dynamically adjusted thresholds based on noise energy estimation. Thresholds are adapted taking into account the dynamic changes of the speech signal under environmental noise. The proposed method run in real time and does not require a priori knowledge of the SNR and a priori threshold value
Classification
Type
D - Article in proceedings
CEP classification
JA - Electronics and optoelectronics
OECD FORD branch
—
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>Z - Vyzkumny zamer (s odkazem do CEZ)
Others
Publication year
2006
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
16th Czech-German Workshop on Speech Processing
ISBN
8086269159
ISSN
—
e-ISSN
—
Number of pages
130
Pages from-to
125-254
Publisher name
Institute of Radio Engineering and Electronics AS CR
Place of publication
Praha
Event location
Praha
Event date
Sep 27, 2006
Type of event by nationality
EUR - Evropská akce
UT code for WoS article
—