Přístupnostní navigace
E-application
Search Search Close
Publication detail
POVEY, D. GHOSHAL, A. BOULIANNE, G. BURGET, L. GLEMBEK, O. GOEL, N. HANNEMANN, M. MOTLÍČEK, P. QIAN, Y. SCHWARZ, P. SILOVSKÝ, J. STEMMER, G. VESELÝ, K.
Original Title
The Kaldi Speech Recognition Toolkit
Type
article in a collection out of WoS and Scopus
Language
English
Original Abstract
We described the design of Kaldi, a free and open-source speech recognition toolkit. The toolkit currently supports modelling of context-dependent phones of arbitrary context lengths, and all commonly used techniques that can be estimated using maximum likelihood. It also supports the recently proposed SGMMs. Development of Kaldi is continuing and we are working on using large language models in the FST framework, lattice generation and discriminative training.
Keywords
speech recognition, toolkit
Authors
POVEY, D.; GHOSHAL, A.; BOULIANNE, G.; BURGET, L.; GLEMBEK, O.; GOEL, N.; HANNEMANN, M.; MOTLÍČEK, P.; QIAN, Y.; SCHWARZ, P.; SILOVSKÝ, J.; STEMMER, G.; VESELÝ, K.
RIV year
2011
Released
11. 12. 2011
Publisher
IEEE Signal Processing Society
Location
Hilton Waikoloa Village Resort, Hawaii
ISBN
978-1-4673-0366-8
Book
Proceedings of ASRU 2011
Pages from
1
Pages to
4
Pages count
URL
http://www.fit.vutbr.cz/research/groups/speech/publi/2011/povey_asru2011_Kaldi%20toolkit.pdf
BibTex
@inproceedings{BUT127200, author="Daniel {Povey} and Arnab {Ghoshal} and Gilles {Boulianne} and Lukáš {Burget} and Ondřej {Glembek} and Nagendra {Goel} and Mirko {Hannemann} and Petr {Motlíček} and Yanmin {Qian} and Petr {Schwarz} and Jan {Silovský} and Georg {Stemmer} and Karel {Veselý}", title="The Kaldi Speech Recognition Toolkit", booktitle="Proceedings of ASRU 2011", year="2011", pages="1--4", publisher="IEEE Signal Processing Society", address="Hilton Waikoloa Village Resort, Hawaii", isbn="978-1-4673-0366-8", url="http://www.fit.vutbr.cz/research/groups/speech/publi/2011/povey_asru2011_Kaldi%20toolkit.pdf" }