Publication detail

The Kaldi Speech Recognition Toolkit

POVEY, D. GHOSHAL, A. BOULIANNE, G. BURGET, L. GLEMBEK, O. GOEL, N. HANNEMANN, M. MOTLÍČEK, P. QIAN, Y. SCHWARZ, P. SILOVSKÝ, J. STEMMER, G. VESELÝ, K.

Original Title

The Kaldi Speech Recognition Toolkit

Type

article in a collection out of WoS and Scopus

Language

English

Original Abstract

We described the design of Kaldi, a free and open-sourcespeech recognition toolkit. The toolkit currently supports modellingof context-dependent phones of arbitrary context lengths,and all commonly used techniques that can be estimated usingmaximum likelihood. It also supports the recently proposedSGMMs. Development of Kaldi is continuing and we areworking on using large language models in the FST framework,lattice generation and discriminative training.

Keywords

speech recognition, toolkit

Authors

POVEY, D.; GHOSHAL, A.; BOULIANNE, G.; BURGET, L.; GLEMBEK, O.; GOEL, N.; HANNEMANN, M.; MOTLÍČEK, P.; QIAN, Y.; SCHWARZ, P.; SILOVSKÝ, J.; STEMMER, G.; VESELÝ, K.

RIV year

2011

Released

11. 12. 2011

Publisher

IEEE Signal Processing Society

Location

Hilton Waikoloa Village Resort, Hawaii

ISBN

978-1-4673-0366-8

Book

Proceedings of ASRU 2011

Pages from

1

Pages to

4

Pages count

4

URL

BibTex

@inproceedings{BUT127200,
  author="Daniel {Povey} and Arnab {Ghoshal} and Gilles {Boulianne} and Lukáš {Burget} and Ondřej {Glembek} and Nagendra {Goel} and Mirko {Hannemann} and Petr {Motlíček} and Yanmin {Qian} and Petr {Schwarz} and Jan {Silovský} and Georg {Stemmer} and Karel {Veselý}",
  title="The Kaldi Speech Recognition Toolkit",
  booktitle="Proceedings of ASRU 2011",
  year="2011",
  pages="1--4",
  publisher="IEEE Signal Processing Society",
  address="Hilton Waikoloa Village Resort, Hawaii",
  isbn="978-1-4673-0366-8",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2011/povey_asru2011_Kaldi%20toolkit.pdf"
}