Přístupnostní navigace
E-application
Search Search Close
Publication detail
ŽMOLÍKOVÁ, K. KARAFIÁT, M. VESELÝ, K. DELCROIX, M. WATANABE, S. BURGET, L. ČERNOCKÝ, J.
Original Title
Data selection by sequence summarizing neural network in mismatch condition training
Type
conference paper
Language
English
Original Abstract
Data augmentation is a simple and efficient technique to improve the robustness of a speech recognizer when deployed in mismatched training-test conditions. Our paper proposes a new approach for selecting data with respect to similarity of acoustic conditions. The similarity is computed based on a sequence summarizing neural network which extracts vectors containing acoustic summary (e.g. noise and reverberation characteristics) of an utterance. Several configurations of this network and different methods of selecting data using these "summary-vectors" were explored. The results are reported on a mismatched condition using AMI training set with the proposed data selection and CHiME3 test set.
Keywords
Automatic speech recognition, Data augmentation, Data selection, Mismatch training condition, Sequence summarization
Authors
ŽMOLÍKOVÁ, K.; KARAFIÁT, M.; VESELÝ, K.; DELCROIX, M.; WATANABE, S.; BURGET, L.; ČERNOCKÝ, J.
Released
8. 9. 2016
Publisher
International Speech Communication Association
Location
San Francisco
ISBN
978-1-5108-3313-5
Book
Proceedings of Interspeech 2016
Pages from
2354
Pages to
2358
Pages count
5
URL
https://www.semanticscholar.org/paper/Data-Selection-by-Sequence-Summarizing-Neural-Zmol%C3%ADkov%C3%A1-Karafi%C3%A1t/bc1832e8b8d4e5edf987e1562b578bd9aa5e18a9
BibTex
@inproceedings{BUT132600, author="Kateřina {Žmolíková} and Martin {Karafiát} and Karel {Veselý} and Marc {Delcroix} and Shinji {Watanabe} and Lukáš {Burget} and Jan {Černocký}", title="Data selection by sequence summarizing neural network in mismatch condition training", booktitle="Proceedings of Interspeech 2016", year="2016", pages="2354--2358", publisher="International Speech Communication Association", address="San Francisco", doi="10.21437/Interspeech.2016-741", isbn="978-1-5108-3313-5", url="https://www.semanticscholar.org/paper/Data-Selection-by-Sequence-Summarizing-Neural-Zmol%C3%ADkov%C3%A1-Karafi%C3%A1t/bc1832e8b8d4e5edf987e1562b578bd9aa5e18a9" }