Detail publikace

BUT OpenSAT 2017 speech recognition system

KARAFIÁT, M. BASKAR, M. SZŐKE, I. MALENOVSKÝ, V. VESELÝ, K. GRÉZL, F. BURGET, L. ČERNOCKÝ, J.

Originální název

BUT OpenSAT 2017 speech recognition system

Typ

článek ve sborníku ve WoS nebo Scopus

Jazyk

angličtina

Originální abstrakt

(ASR) systems for two domains in OpenSAT evaluations: LowResourced Languages and Public Safety Communications. Thefirst was challenging due to lack of training data, therefore multilingualapproaches for BLSTM training were employed andrecently published Residual Memory Networks requiring lesstraining data were used. Combination of both approaches led tosuperior performance. The second domain was challenging dueto recording in extreme conditions: specific channel, speakerunder stress, high levels of noise. A data augmentation processwas very important to get reasonably good performance.

Klíčová slova

speech recognition, multilingual training, BLSTM, data augmentation, robustness

Autoři

KARAFIÁT, M.; BASKAR, M.; SZŐKE, I.; MALENOVSKÝ, V.; VESELÝ, K.; GRÉZL, F.; BURGET, L.; ČERNOCKÝ, J.

Vydáno

2. 9. 2018

Nakladatel

International Speech Communication Association

Místo

Hyderabad

ISSN

1990-9772

Periodikum

Proceedings of Interspeech

Ročník

2018

Číslo

9

Stát

Francouzská republika

Strany od

2638

Strany do

2642

Strany počet

5

URL

BibTex

@inproceedings{BUT155099,
  author="Martin {Karafiát} and Murali Karthick {Baskar} and Igor {Szőke} and Vladimír {Malenovský} and Karel {Veselý} and František {Grézl} and Lukáš {Burget} and Jan {Černocký}",
  title="BUT OpenSAT 2017 speech recognition system",
  booktitle="Proceedings of Interspeech 2018",
  year="2018",
  journal="Proceedings of Interspeech",
  volume="2018",
  number="9",
  pages="2638--2642",
  publisher="International Speech Communication Association",
  address="Hyderabad",
  doi="10.21437/Interspeech.2018-2457",
  issn="1990-9772",
  url="https://www.isca-speech.org/archive/Interspeech_2018/abstracts/2457.html"
}

Dokumenty