Přístupnostní navigace
E-application
Search Search Close
Publication detail
FAPŠO, M. SMRŽ, P. SCHWARZ, P. SZŐKE, I. SCHWARZ, M. ČERNOCKÝ, J. KARAFIÁT, M. BURGET, L.
Original Title
Information Retrieval from Spoken Documents
Type
conference paper
Language
English
Original Abstract
This paper describes a designed and implemented system for efficient storage, indexing and search in collections of spoken documents that takes advantage of automatic speech recognition. As the quality of current speech recognizers is not sufficient for a great deal of applications, it is necessary to index the ambiguous output of the recognition, i.\,e. the acyclic graphs of word hypotheses --- recognition lattices. Then, it is not possible to directly apply the standard methods known from text-based systems. The paper discusses an optimized indexing system for efficient search in the complex and large data structure that has been developed by our group. The search engine works as a server. The meeting browser JFerret, developed withing the European AMI project, is used as a client to browse search results.
Keywords
multimedia information retrieval, speech databases
Authors
FAPŠO, M.; SMRŽ, P.; SCHWARZ, P.; SZŐKE, I.; SCHWARZ, M.; ČERNOCKÝ, J.; KARAFIÁT, M.; BURGET, L.
RIV year
2006
Released
29. 9. 2006
Publisher
Springer Verlag
Location
Mexico City
ISBN
3-540-32205-1
Book
Proceedings of the Seventh International Conference on Intelligent Text Processing and Computational Linguistics (CICLING 2006)
Pages from
410
Pages to
416
Pages count
6
BibTex
@inproceedings{BUT22168, author="Michal {Fapšo} and Pavel {Smrž} and Petr {Schwarz} and Igor {Szőke} and Milan {Schwarz} and Jan {Černocký} and Martin {Karafiát} and Lukáš {Burget}", title="Information Retrieval from Spoken Documents", booktitle="Proceedings of the Seventh International Conference on Intelligent Text Processing and Computational Linguistics (CICLING 2006)", year="2006", pages="410--416", publisher="Springer Verlag", address="Mexico City", isbn="3-540-32205-1" }