Detail publikace
Bayesian joint-sequence models for grapheme-to-phoneme conversion
HANNEMANN, M. TRMAL, J. ONDEL YANG, L. KESIRAJU, S. BURGET, L.
Originální název
Bayesian joint-sequence models for grapheme-to-phoneme conversion
Typ
článek ve sborníku ve WoS nebo Scopus
Jazyk
angličtina
Originální abstrakt
We describe a fully Bayesian approach to grapheme-to-phonemeconversion based on the joint-sequence model (JSM). Usually, standardsmoothed n-gram language models (LM, e.g. Kneser-Ney)are used with JSMs to model graphone sequences (joint graphemephonemepairs). However, we take a Bayesian approach using ahierarchical Pitman-Yor-Process LM. This provides an elegant alternativeto using smoothing techniques to avoid over-training. Noheld-out sets and complex parameter tuning is necessary, and severalconvergence problems encountered in the discounted Expectation-Maximization (as used in the smoothed JSMs) are avoided. Everystep is modeled by weighted finite state transducers and implementedwith standard operations from the OpenFST toolkit. Weevaluate our model on a standard data set (CMUdict), where it givescomparable results to the previously reported smoothed JSMs interms of phoneme-error rate while requiring a much smaller training/testing time. Most importantly, our model can be used in aBayesian framework and for (partly) un-supervised training.
Klíčová slova
Bayesian approach, joint-sequence models,weighted finite state transducers, letter-to-sound, grapheme-tophoneme conversion, hierarchical Pitman-Yor-Process
Autoři
HANNEMANN, M.; TRMAL, J.; ONDEL YANG, L.; KESIRAJU, S.; BURGET, L.
Vydáno
5. 3. 2017
Nakladatel
IEEE Signal Processing Society
Místo
New Orleans
ISBN
978-1-5090-4117-6
Kniha
Proceedings of ICASSP 2017
Strany od
2836
Strany do
2840
Strany počet
5
URL
BibTex
@inproceedings{BUT144449,
author="Mirko {Hannemann} and Jan {Trmal} and Lucas Antoine Francois {Ondel} and Santosh {Kesiraju} and Lukáš {Burget}",
title="Bayesian joint-sequence models for grapheme-to-phoneme conversion",
booktitle="Proceedings of ICASSP 2017",
year="2017",
pages="2836--2840",
publisher="IEEE Signal Processing Society",
address="New Orleans",
doi="10.1109/ICASSP.2017.7952674",
isbn="978-1-5090-4117-6",
url="https://www.fit.vut.cz/research/publication/11469/"
}
Dokumenty