Fast Approximate Spoken Term Detection from Sequence of Phonemes

The result's identifiers

Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216305%3A26230%2F08%3APU80187" target="_blank" >RIV/00216305:26230/08:PU80187 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—

Alternative languages

Result language
angličtina
Original language name
Fast Approximate Spoken Term Detection from Sequence of Phonemes
Original language description
We investigate the detection of spoken terms in conversa- tional speech using phoneme recognition with the objective of achieving smaller index size as well as faster search speed. Speech is processed and indexed as a sequence of one best phoneme sequence. We propose the use of a probabilistic pronunciation model for the search term to compensate for the errors in the recognition of phonemes. This model is de- rived using the pronunciation of the word and the phoneme confusion matrix. Experiments are performed on the con- versational telephone speech database distributed by NIST for the 2006 spoken term detection. We achieve about 1500 times smaller index size and 14 times faster search speed compared tothe system using phoneme lattices, at the cost of relatively lower detection performance.
Czech name
Fast Approximate Spoken Term Detection from Sequence of Phonemes
Czech description
We investigate the detection of spoken terms in conversa- tional speech using phoneme recognition with the objective of achieving smaller index size as well as faster search speed. Speech is processed and indexed as a sequence of one best phoneme sequence. We propose the use of a probabilistic pronunciation model for the search term to compensate for the errors in the recognition of phonemes. This model is de- rived using the pronunciation of the word and the phoneme confusion matrix. Experiments are performed on the con- versational telephone speech database distributed by NIST for the 2006 spoken term detection. We achieve about 1500 times smaller index size and 14 times faster search speed compared to the system using phoneme lattices, at the cost of relatively lower detection performance.

Classification

Type
D - Article in proceedings
CEP classification
JC - Computer hardware and software
OECD FORD branch
—

Result continuities

Project
—
Continuities
Z - Vyzkumny zamer (s odkazem do CEZ)

Others

Publication year
2008
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

Article name in the collection
The 31st Annual International ACM SIGIR Conference 20-24 July 2008, Singapore
ISBN
978-90-365-2697-5
ISSN
—
e-ISSN
—
Number of pages
8
Pages from-to
—
Publisher name
Association for Computing Machinery
Place of publication
Singapore
Event location
Singapur
Event date
Jul 20, 2008
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—

Similar results(10)

Combination of Word and Phoneme Approach for Spoken Term Detection Hybrid word-subword decoding for spoken term detection Deep LSTM Spoken Term Detection using Wav2Vec 2.0 Recognizer

What are you looking for?

Quick search

Smart search

Fast Approximate Spoken Term Detection from Sequence of Phonemes

The result's identifiers

Alternative languages

Classification

Result continuities

Others

Data specific for result type

Similar results(10)

What are you looking for?

Quick search

Smart search

Result description

The result's identifiers

The result's identifiers

Alternative languages

Alternative languages

Classification

Classification

Result continuities

Result continuities

Others

Others

Data specific for result type

Data specific for result type

Similar results(10)