Přístupnostní navigace
E-přihláška
Vyhledávání Vyhledat Zavřít
Detail publikace
HAIN, T. BURGET, L. DINES, J. GARAU, G. KARAFIÁT, M. LINCOLN, M. MCCOWAN, I. MOORE, D. WAN, V. ORDELMAN, R. RENALS, S.
Originální název
The 2005 AMI System for the Transcription of Speech in Meetings
Typ
článek ve sborníku ve WoS nebo Scopus
Jazyk
angličtina
Originální abstrakt
In this paper we describe the 2005 AMI system for the transcription of speech in meetings used for participation in the 2005 NIST RT evaluations. The system was designed for participation in the speech to text part of the evaluations, in particular for transcription of speech recorded with multiple distant microphones and independent headset microphones. System performance was tested on both conference room and lecture style meetings. Although input sources are processed using different front-ends, the recognition process is based on a unified system architecture. The system operates in multiple passes and makes use of state of the art technologies such as discriminative training, vocal tract length normalisation, heteroscedastic linear discriminant analysis,speaker adaptation with maximum likelihood linear regression and minimum word error rate decoding. In this paper we describe the system performance on the official development and test sets for the NIST RT05sevaluations. The system was jointly developed in less than 10 months by a multi-site team and was shown to achieve very competitive performance
Klíčová slova
NIST, speech recognition, AMI system
Autoři
HAIN, T.; BURGET, L.; DINES, J.; GARAU, G.; KARAFIÁT, M.; LINCOLN, M.; MCCOWAN, I.; MOORE, D.; WAN, V.; ORDELMAN, R.; RENALS, S.
Rok RIV
2005
Vydáno
13. 7. 2005
Nakladatel
University of Edinburgh
Místo
Edinburgh
ISBN
978-3-540-32549-9
Kniha
Machine Learning for Multimodal Interaction, Second International Workshop, MLMI 2005, Edinburgh, UK, July 11-13, 2005, Revised Selected Papers
Edice
Lecture Notes in Computer Science Volume 3869, Springer 2006
Strany od
450
Strany do
462
Strany počet
12
URL
http://www.fit.vutbr.cz/~karafiat/publi/2005/hain-nist-2005.pdf
BibTex
@inproceedings{BUT18267, author="Thomas {Hain} and Lukáš {Burget} and John {Dines} and Giulia {Garau} and Martin {Karafiát} and Mike {Lincoln} and Iain {McCowan} and Darren {Moore} and Vincent {Wan} and Roeland {Ordelman} and Steve {Renals}", title="The 2005 AMI System for the Transcription of Speech in Meetings", booktitle="Machine Learning for Multimodal Interaction, Second International Workshop, MLMI 2005, Edinburgh, UK, July 11-13, 2005, Revised Selected Papers", year="2005", series="Lecture Notes in Computer Science Volume 3869, Springer 2006", pages="450--462", publisher="University of Edinburgh", address="Edinburgh", isbn="978-3-540-32549-9", url="http://www.fit.vutbr.cz/~karafiat/publi/2005/hain-nist-2005.pdf" }