Detail publikace

Bayesian joint-sequence models for grapheme-to-phoneme conversion

HANNEMANN, M. TRMAL, J. ONDEL YANG, L. KESIRAJU, S. BURGET, L.

Originální název

Typ

článek ve sborníku ve WoS nebo Scopus

Jazyk

angličtina

Originální abstrakt

We describe a fully Bayesian approach to grapheme-to-phonemeconversion based on the joint-sequence model (JSM). Usually, standardsmoothed n-gram language models (LM, e.g. Kneser-Ney)are used with JSMs to model graphone sequences (joint graphemephonemepairs). However, we take a Bayesian approach using ahierarchical Pitman-Yor-Process LM. This provides an elegant alternativeto using smoothing techniques to avoid over-training. Noheld-out sets and complex parameter tuning is necessary, and severalconvergence problems encountered in the discounted Expectation-Maximization (as used in the smoothed JSMs) are avoided. Everystep is modeled by weighted finite state transducers and implementedwith standard operations from the OpenFST toolkit. Weevaluate our model on a standard data set (CMUdict), where it givescomparable results to the previously reported smoothed JSMs interms of phoneme-error rate while requiring a much smaller training/testing time. Most importantly, our model can be used in aBayesian framework and for (partly) un-supervised training.

Klíčová slova

Bayesian approach, joint-sequence models,weighted finite state transducers, letter-to-sound, grapheme-tophoneme conversion, hierarchical Pitman-Yor-Process

Autoři

HANNEMANN, M.; TRMAL, J.; ONDEL YANG, L.; KESIRAJU, S.; BURGET, L.

Vydáno

5. 3. 2017

Nakladatel

IEEE Signal Processing Society

Místo

New Orleans

ISBN

978-1-5090-4117-6

Kniha

Proceedings of ICASSP 2017

Strany od

2836

Strany do

2840

Strany počet

URL

https://www.fit.vut.cz/research/publication/11469/

BibTex

@inproceedings{BUT144449,
  author="Mirko {Hannemann} and Jan {Trmal} and Lucas Antoine Francois {Ondel} and Santosh {Kesiraju} and Lukáš {Burget}",
  title="Bayesian joint-sequence models for grapheme-to-phoneme conversion",
  booktitle="Proceedings of ICASSP 2017",
  year="2017",
  pages="2836--2840",
  publisher="IEEE Signal Processing Society",
  address="New Orleans",
  doi="10.1109/ICASSP.2017.7952674",
  isbn="978-1-5090-4117-6",
  url="https://www.fit.vut.cz/research/publication/11469/"
}

Dokumenty

hannemann_icassp2017_0002836.pdf

VUT

Fakulty

Vysokoškolské ústavy

Součásti

Bayesian joint-sequence models for grapheme-to-phoneme conversion