Publication detail

The Kaldi Speech Recognition Toolkit

POVEY, D. GHOSHAL, A. BOULIANNE, G. BURGET, L. GLEMBEK, O. GOEL, N. HANNEMANN, M. MOTLÍČEK, P. QIAN, Y. SCHWARZ, P. SILOVSKÝ, J. STEMMER, G. VESELÝ, K.

Original Title

The Kaldi Speech Recognition Toolkit

Type

article in a collection out of WoS and Scopus

Language

English

Original Abstract

We described the design of Kaldi, a free and open-source speech recognition toolkit. The toolkit currently supports modelling of context-dependent phones of arbitrary context lengths, and all commonly used techniques that can be estimated using maximum likelihood. It also supports the recently proposed SGMMs. Development of Kaldi is continuing and we are working on using large language models in the FST framework, lattice generation and discriminative training.

Keywords

speech recognition, toolkit

Authors

POVEY, D.; GHOSHAL, A.; BOULIANNE, G.; BURGET, L.; GLEMBEK, O.; GOEL, N.; HANNEMANN, M.; MOTLÍČEK, P.; QIAN, Y.; SCHWARZ, P.; SILOVSKÝ, J.; STEMMER, G.; VESELÝ, K.

RIV year

2011

Released

11. 12. 2011

Publisher

IEEE Signal Processing Society

Location

Hilton Waikoloa Village Resort, Hawaii

ISBN

978-1-4673-0366-8

Book

Proceedings of ASRU 2011

Pages from

1

Pages to

4

Pages count

4

URL

BibTex

@inproceedings{BUT127200,
  author="Daniel {Povey} and Arnab {Ghoshal} and Gilles {Boulianne} and Lukáš {Burget} and Ondřej {Glembek} and Nagendra {Goel} and Mirko {Hannemann} and Petr {Motlíček} and Yanmin {Qian} and Petr {Schwarz} and Jan {Silovský} and Georg {Stemmer} and Karel {Veselý}",
  title="The Kaldi Speech Recognition Toolkit",
  booktitle="Proceedings of ASRU 2011",
  year="2011",
  pages="1--4",
  publisher="IEEE Signal Processing Society",
  address="Hilton Waikoloa Village Resort, Hawaii",
  isbn="978-1-4673-0366-8",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2011/povey_asru2011_Kaldi%20toolkit.pdf"
}