Publication detail

Multilingual acoustic modeling for speech recognition based on Subspace Gaussian Mixture Models

BURGET, L. SCHWARZ, P. AGARWAL, M. AKYAZI, P. FENG, K. GHOSHAL, A. GLEMBEK, O. GOEL, N. KARAFIÁT, M. POVEY, D. RASTROW, A. ROSE, R. THOMAS, S.

Original Title

Multilingual acoustic modeling for speech recognition based on Subspace Gaussian Mixture Models

Type

article in a collection out of WoS and Scopus

Language

English

Original Abstract

This paper is on a different approach to multilingual speech recognition, in which the phone sets are entirely distinct but the model has parameters not tied to specific states that are shared across languages.

Keywords

Large vocabulary speech recognition, Subspace Gaussian mixture model, Multilingual acoustic modeling

Authors

BURGET, L.; SCHWARZ, P.; AGARWAL, M.; AKYAZI, P.; FENG, K.; GHOSHAL, A.; GLEMBEK, O.; GOEL, N.; KARAFIÁT, M.; POVEY, D.; RASTROW, A.; ROSE, R.; THOMAS, S.

RIV year

2010

Released

1. 4. 2010

Publisher

IEEE Signal Processing Society

Location

Dallas

ISBN

978-1-4244-4296-6

Book

Proc. International Conference on Acoustictics, Speech, and Signal Processing

ISBN

1520-6149

Periodical

Proc. International Conference on Acoustics, Speech, and Signal Processing

Year of study

2010

Number

3

State

United States of America

Pages from

4334

Pages to

4337

Pages count

4

URL

BibTex

@inproceedings{BUT37044,
  author="Lukáš {Burget} and Petr {Schwarz} and Mohit {Agarwal} and Pinar {Akyazi} and Kai {Feng} and Arnab {Ghoshal} and Ondřej {Glembek} and Nagendra {Goel} and Martin {Karafiát} and Daniel {Povey} and Ariya {Rastrow} and Richard {Rose} and Samuel {Thomas}",
  title="Multilingual acoustic modeling for speech recognition based on Subspace Gaussian Mixture Models",
  booktitle="Proc. International Conference on Acoustictics, Speech, and Signal Processing",
  year="2010",
  journal="Proc. International Conference on Acoustics, Speech, and Signal Processing",
  volume="2010",
  number="3",
  pages="4334--4337",
  publisher="IEEE Signal Processing Society",
  address="Dallas",
  isbn="978-1-4244-4296-6",
  issn="1520-6149",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2010/burget_icassp2010_4334.pdf"
}