Publication detail
BUT system for low resource Indian language ASR
PULUGUNDLA, B. BASKAR, M. KESIRAJU, S. EGOROVA, E. KARAFIÁT, M. BURGET, L. ČERNOCKÝ, J.
Original Title
BUT system for low resource Indian language ASR
Type
conference paper
Language
English
Original Abstract
This paper describes the BUT Jilebi teams speech recognitionsystems created for the 2018 low resource speech recognitionchallenge for Indian languages. We investigate modifications ofmultilingual time-delay neural network (TDNN) architectureswith transfer learning and compare them to bi-directionalresidual memory networks (BRMN) and bi-directional LSTM.Our best submission based on system combination achievedword error rates of 13.92% (Tamil), 14.71% (Telugu) and14.06% (Gujarati). We present the details of submitted systemsand also the post-evaluation analysis done for lexicon discoveryusing unsupervised word segmentation.
Keywords
Indian languages, low resource ASR, multilingual, LF-MMI
Authors
PULUGUNDLA, B.; BASKAR, M.; KESIRAJU, S.; EGOROVA, E.; KARAFIÁT, M.; BURGET, L.; ČERNOCKÝ, J.
Released
2. 9. 2018
Publisher
International Speech Communication Association
Location
Hyderabad
ISBN
1990-9772
Periodical
Proceedings of Interspeech
Year of study
2018
Number
9
State
French Republic
Pages from
3182
Pages to
3186
Pages count
5
URL
BibTex
@inproceedings{BUT155101,
author="Bhargav {Pulugundla} and Murali Karthick {Baskar} and Santosh {Kesiraju} and Ekaterina {Egorova} and Martin {Karafiát} and Lukáš {Burget} and Jan {Černocký}",
title="BUT system for low resource Indian language ASR",
booktitle="Proceedings of Interspeech 2018",
year="2018",
journal="Proceedings of Interspeech",
volume="2018",
number="9",
pages="3182--3186",
publisher="International Speech Communication Association",
address="Hyderabad",
doi="10.21437/Interspeech.2018-1302",
issn="1990-9772",
url="https://www.isca-speech.org/archive/Interspeech_2018/abstracts/1302.html"
}
Documents