Přístupnostní navigace
E-application
Search Search Close
Publication detail
PINTO, J. SZŐKE, I. PRASANNA, S. HEŘMANSKÝ, H.
Original Title
Fast Approximate Spoken Term Detection from Sequence of Phonemes
Type
article in a collection out of WoS and Scopus
Language
English
Original Abstract
We investigate the detection of spoken terms in conversa-tional speech using phoneme recognition with the objectiveof achieving smaller index size as well as faster search speed.Speech is processed and indexed as a sequence of one bestphoneme sequence. We propose the use of a probabilisticpronunciation model for the search term to compensate forthe errors in the recognition of phonemes. This model is de-rived using the pronunciation of the word and the phonemeconfusion matrix. Experiments are performed on the con-versational telephone speech database distributed by NISTfor the 2006 spoken term detection. We achieve about 1500times smaller index size and 14 times faster search speedcompared to the system using phoneme lattices, at the costof relatively lower detection performance.
Keywords
Spoken term detection, probabilistic pronunciation model, phoneme recognition, confusion matrix
Authors
PINTO, J.; SZŐKE, I.; PRASANNA, S.; HEŘMANSKÝ, H.
RIV year
2008
Released
24. 7. 2008
Publisher
Association for Computing Machinery
Location
Singapore
ISBN
978-90-365-2697-5
Book
The 31st Annual International ACM SIGIR Conference 20-24 July 2008, Singapore
Pages from
28
Pages to
33
Pages count
8
BibTex
@inproceedings{BUT32585, author="Joel {Pinto} and Igor {Szőke} and S.R.M. {Prasanna} and Hynek {Heřmanský}", title="Fast Approximate Spoken Term Detection from Sequence of Phonemes", booktitle="The 31st Annual International ACM SIGIR Conference 20-24 July 2008, Singapore", year="2008", pages="28--33", publisher="Association for Computing Machinery", address="Singapore", isbn="978-90-365-2697-5" }