Detail publikace

Robust Speech Recognition in Unknown Reverberant and Noisy Conditions

HSIAO, R. MA, J. HARTMANN, W. KARAFIÁT, M. GRÉZL, F. BURGET, L. SZŐKE, I. ČERNOCKÝ, J. WATANABE, S. CHEN, Z. MALLIDI, S. HEŘMANSKÝ, H. TSAKALIDIS, S. SCHWARTZ, R.

Originální název

Robust Speech Recognition in Unknown Reverberant and Noisy Conditions

Typ

článek ve sborníku ve WoS nebo Scopus

Jazyk

angličtina

Originální abstrakt

In this paper, we describe our work on the ASpIRE (Automatic Speech recognition In Reverberant Environments) challenge, which aims to assess the robustness of automatic speech recognition (ASR) systems. The main characteristic of the challenge is developing a high-performance system without access to matched training and development data. While the evaluation data are recorded with far-field microphones in noisy and reverberant rooms, the training data are telephone speech and close talking. Our approach to this challenge includes speech enhancement, neural network methods and acoustic model adaptation, We show that these techniques can successfully alleviate the performance degradation due to noisy audio and data mismatch.

Klíčová slova

ASpIRE challenge, robust speech recognition

Autoři

HSIAO, R.; MA, J.; HARTMANN, W.; KARAFIÁT, M.; GRÉZL, F.; BURGET, L.; SZŐKE, I.; ČERNOCKÝ, J.; WATANABE, S.; CHEN, Z.; MALLIDI, S.; HEŘMANSKÝ, H.; TSAKALIDIS, S.; SCHWARTZ, R.

Rok RIV

2015

Vydáno

13. 12. 2015

Nakladatel

IEEE Signal Processing Society

Místo

Scottsdale, Arizona

ISBN

978-1-4799-7291-3

Kniha

Proceedings of 2015 IEEE Automatic Speech Recognition and Understanding Workshop

Strany od

533

Strany do

538

Strany počet

6

URL

BibTex

@inproceedings{BUT120392,
  author="Roger {Hsiao} and Jeff {Ma} and William {Hartmann} and Martin {Karafiát} and František {Grézl} and Lukáš {Burget} and Igor {Szőke} and Jan {Černocký} and Shinji {Watanabe} and Zhuo {Chen} and Sri Harish {Mallidi} and Hynek {Heřmanský} and Stavros {Tsakalidis} and Richard {Schwartz}",
  title="Robust Speech Recognition in Unknown Reverberant and Noisy Conditions",
  booktitle="Proceedings of 2015 IEEE Automatic Speech Recognition and Understanding Workshop",
  year="2015",
  pages="533--538",
  publisher="IEEE Signal Processing Society",
  address="Scottsdale, Arizona",
  doi="10.1109/ASRU.2015.7404841",
  isbn="978-1-4799-7291-3",
  url="https://www.fit.vut.cz/research/publication/11067/"
}

Dokumenty