Přístupnostní navigace
E-přihláška
Vyhledávání Vyhledat Zavřít
Detail publikace
HSIAO, R. MA, J. HARTMANN, W. KARAFIÁT, M. GRÉZL, F. BURGET, L. SZŐKE, I. ČERNOCKÝ, J. WATANABE, S. CHEN, Z. MALLIDI, S. HEŘMANSKÝ, H. TSAKALIDIS, S. SCHWARTZ, R.
Originální název
Robust Speech Recognition in Unknown Reverberant and Noisy Conditions
Typ
článek ve sborníku ve WoS nebo Scopus
Jazyk
angličtina
Originální abstrakt
In this paper, we describe our work on the ASpIRE (Automatic Speech recognition In Reverberant Environments) challenge, which aims to assess the robustness of automatic speech recognition (ASR) systems. The main characteristic of the challenge is developing a high-performance system without access to matched training and development data. While the evaluation data are recorded with far-field microphones in noisy and reverberant rooms, the training data are telephone speech and close talking. Our approach to this challenge includes speech enhancement, neural network methods and acoustic model adaptation, We show that these techniques can successfully alleviate the performance degradation due to noisy audio and data mismatch.
Klíčová slova
ASpIRE challenge, robust speech recognition
Autoři
HSIAO, R.; MA, J.; HARTMANN, W.; KARAFIÁT, M.; GRÉZL, F.; BURGET, L.; SZŐKE, I.; ČERNOCKÝ, J.; WATANABE, S.; CHEN, Z.; MALLIDI, S.; HEŘMANSKÝ, H.; TSAKALIDIS, S.; SCHWARTZ, R.
Rok RIV
2015
Vydáno
13. 12. 2015
Nakladatel
IEEE Signal Processing Society
Místo
Scottsdale, Arizona
ISBN
978-1-4799-7291-3
Kniha
Proceedings of 2015 IEEE Automatic Speech Recognition and Understanding Workshop
Strany od
533
Strany do
538
Strany počet
6
URL
https://www.fit.vut.cz/research/publication/11067/
BibTex
@inproceedings{BUT120392, author="Roger {Hsiao} and Jeff {Ma} and William {Hartmann} and Martin {Karafiát} and František {Grézl} and Lukáš {Burget} and Igor {Szőke} and Jan {Černocký} and Shinji {Watanabe} and Zhuo {Chen} and Sri Harish {Mallidi} and Hynek {Heřmanský} and Stavros {Tsakalidis} and Richard {Schwartz}", title="Robust Speech Recognition in Unknown Reverberant and Noisy Conditions", booktitle="Proceedings of 2015 IEEE Automatic Speech Recognition and Understanding Workshop", year="2015", pages="533--538", publisher="IEEE Signal Processing Society", address="Scottsdale, Arizona", doi="10.1109/ASRU.2015.7404841", isbn="978-1-4799-7291-3", url="https://www.fit.vut.cz/research/publication/11067/" }