Segmentation of Speech and Humming in Vocal Input
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68407700%3A21230%2F12%3A00196538" target="_blank" >RIV/68407700:21230/12:00196538 - isvavai.cz</a>
Result on the web
<a href="http://www.radioeng.cz/fulltexts/2012/12_03_0923_0929.pdf" target="_blank" >http://www.radioeng.cz/fulltexts/2012/12_03_0923_0929.pdf</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Segmentation of Speech and Humming in Vocal Input
Original language description
Non-verbal vocal interaction (NVVI) is an interaction method in which sounds other than speech produced by a human are used, such as humming. NVVI complements traditional speech recognition systems with continuous control. In order to combine the two approaches (e.g. "volume up, mmm") it is necessary to perform a speech/NVVI segmentation of the input sound signal. This paper presents two novel methods of speech and humming segmentation. The first method is based on classification of MFCC and RMS parameters using a neural network (MFCC method), while the other method computes volume changes in the signal (IAC method). The two methods are compared using a corpus collected from 13 speakers. The results indicate that the MFCC method outperforms IAC in terms of accuracy, precision, and recall.
Czech name
—
Czech description
—
Classification
Type
J<sub>x</sub> - Unclassified - Peer-reviewed scientific article (Jimp, Jsc and Jost)
CEP classification
JC - Computer hardware and software
OECD FORD branch
—
Result continuities
Project
—
Continuities
Z - Vyzkumny zamer (s odkazem do CEZ)
Others
Publication year
2012
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Radioengineering
ISSN
1210-2512
e-ISSN
—
Volume of the periodical
21
Issue of the periodical within the volume
3
Country of publishing house
CZ - CZECH REPUBLIC
Number of pages
7
Pages from-to
923-929
UT code for WoS article
000309253000021
EID of the result in the Scopus database
—