Neural-Network-based Spectrum Processing for Speech Recognition and Speaker Verification
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F15%3A43926637" target="_blank" >RIV/49777513:23520/15:43926637 - isvavai.cz</a>
Result on the web
<a href="http://dx.doi.org/10.1007/978-3-319-25789-1_27" target="_blank" >http://dx.doi.org/10.1007/978-3-319-25789-1_27</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-319-25789-1_27" target="_blank" >10.1007/978-3-319-25789-1_27</a>
Alternative languages
Result language
angličtina
Original language name
Neural-Network-based Spectrum Processing for Speech Recognition and Speaker Verification
Original language description
In this paper, neural networks are applied as a feature extractors for a speech recognition system and a speaker verification system. A long-temporal features with delta coefficients, mean and variance normalization are applied when a neural-network-based feature extraction is trained together with a neural-network-based voice activity detector and with a neural-network-based acoustic model for speech recognition. In speaker verification, the acoustic model is replaced with a score computation. The performance of our speech recognition system was evaluated on the British English speech corpus WSJCAM0 and the performance of our speech verification system was evaluated on our Czech speech corpus.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
20205 - Automation and control systems
Result continuities
Project
<a href="/en/project/DF12P01OVV022" target="_blank" >DF12P01OVV022: ASR- and MT-based Access to a Large Archive of Cultural Heritage (AMALACH)</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2015
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Statistical Language and Speech Processing, Third International Conference, SLSP 2015, Budapest, Hungary, November 24-26, 2015. Proceedings
ISBN
978-3-319-25788-4
ISSN
0302-9743
e-ISSN
—
Number of pages
12
Pages from-to
288-299
Publisher name
Springer
Place of publication
Heidelberg
Event location
Budapešť, Maďarsko
Event date
Nov 24, 2015
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—