Přístupnostní navigace
E-přihláška
Vyhledávání Vyhledat Zavřít
Detail projektu
Období řešení: 01.01.2018 — 31.12.2019
Zdroje financování
Technologická agentura ČR - Program na podporu aplikovaného výzkumu ZÉTA
- částečně financující (2018-01-01 - 2019-12-31)
O projektu
Projekt se zabývá neuronovými sítěmi pro zpracování signálu a dolování informací v řeči.
Popis anglickyThe project deals with neural networks for signal processing and speech data mining
Klíčová slovaneuronové sítě
Klíčová slova anglickyneural networks
Označení
TJ01000208
Originální jazyk
čeština
Řešitelé
Žmolíková Kateřina, Ing., Ph.D. - hlavní řešitel
Útvary
Ústav počítačové grafiky a multimédií- příjemce (02.05.2017 - 31.12.2019)Phonexia- spolupříjemce (02.05.2017 - 31.12.2019)
Výsledky
KARAFIÁT, M.; BASKAR, M.; VESELÝ, K.; GRÉZL, F.; BURGET, L.; ČERNOCKÝ, J. Analysis of Multilingual BLSTM Acoustic Model on Low and High Resource Languages. In Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018. p. 5789-5793. ISBN: 978-1-5386-4658-8.Detail
ALAM, J.; BHATTACHARYA, G.; BRUMMER, J.; BURGET, L.; DIEZ SÁNCHEZ, M.; GLEMBEK, O.; KENNY, P.; KLČO, M.; LANDINI, F.; LOZANO DÍEZ, A.; MATĚJKA, P.; MONTEIRO, J.; MOŠNER, L.; NOVOTNÝ, O.; PLCHOT, O.; PROFANT, J.; ROHDIN, J.; SILNOVA, A.; SLAVÍČEK, J.; STAFYLAKIS, T.; ZEINALI, H. ABC NIST SRE 2018 SYSTEM DESCRIPTION. Proceedings of 2018 NIST SRE Workshop. Athens: National Institute of Standards and Technology, 2018. p. 1-10.Detail
ROHDIN, J.; SILNOVA, A.; DIEZ SÁNCHEZ, M.; PLCHOT, O.; MATĚJKA, P.; BURGET, L. End-to-End DNN Based Speaker Recognition Inspired by i-Vector and PLDA. In Proceedings of ICASSP. Calgary: IEEE Signal Processing Society, 2018. p. 4874-4878. ISBN: 978-1-5386-4658-8.Detail
EGOROVA, E.; BURGET, L. Out-of-Vocabulary Word Recovery Using FST-Based Subword Unit Clustering in a Hybrid ASR System. In Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018. p. 5919-5923. ISBN: 978-1-5386-4658-8.Detail
SILNOVA, A.; MATĚJKA, P.; GLEMBEK, O.; PLCHOT, O.; NOVOTNÝ, O.; GRÉZL, F.; SCHWARZ, P.; ČERNOCKÝ, J. BUT/Phonexia Bottleneck Feature Extractor. In Proceedings of Odyssey 2018. Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland. Les Sables d´Olonne: International Speech Communication Association, 2018. p. 283-287. ISSN: 2312-2846.Detail
BRUMMER, J.; SILNOVA, A.; BURGET, L.; STAFYLAKIS, T. Gaussian meta-embeddings for efficient scoring of a heavy-tailed PLDA model. In Proceedings of Odyssey 2018. Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland. Les Sables d'Olonne: International Speech Communication Association, 2018. p. 349-356. ISSN: 2312-2846.Detail
SILNOVA, A.; BRUMMER, J.; GARCÍA-ROMERO, D.; SNYDER, D.; BURGET, L. Fast variational Bayes for heavy-tailed PLDA applied to i-vectors and x-vectors. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018. p. 72-76. ISSN: 1990-9772.Detail
KARAFIÁT, M.; BASKAR, M.; SZŐKE, I.; MALENOVSKÝ, V.; VESELÝ, K.; GRÉZL, F.; BURGET, L.; ČERNOCKÝ, J. BUT OpenSAT 2017 speech recognition system. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018. p. 2638-2642. ISSN: 1990-9772.Detail
DIEZ SÁNCHEZ, M.; LANDINI, F.; BURGET, L.; ROHDIN, J.; SILNOVA, A.; ŽMOLÍKOVÁ, K.; NOVOTNÝ, O.; VESELÝ, K.; GLEMBEK, O.; PLCHOT, O.; MOŠNER, L.; MATĚJKA, P. BUT system for DIHARD Speech Diarization Challenge 2018. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018. p. 2798-2802. ISSN: 1990-9772.Detail
PULUGUNDLA, B.; BASKAR, M.; KESIRAJU, S.; EGOROVA, E.; KARAFIÁT, M.; BURGET, L.; ČERNOCKÝ, J. BUT system for low resource Indian language ASR. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018. p. 3182-3186. ISSN: 1990-9772.Detail
BENEŠ, K.; KESIRAJU, S.; BURGET, L. i-vectors in language modeling: An efficient way of domain adaptation for feed-forward models. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018. p. 3383-3387. ISSN: 1990-9772.Detail
VESELÝ, K.; PERALES, C.; SZŐKE, I.; LUQUE, J.; ČERNOCKÝ, J. Lightly supervised vs. semi-supervised training of acoustic model on Luxembourgish for low-resource automatic speech recognition. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018. p. 2883-2887. ISSN: 1990-9772.Detail
ROHDIN, J.; SILNOVA, A.; DIEZ SÁNCHEZ, M.; PLCHOT, O.; MATĚJKA, P.; BURGET, L.; GLEMBEK, O. End-to-end DNN based text-independent speaker recognition for long and short utterances. COMPUTER SPEECH AND LANGUAGE, 2020, vol. 2020, no. 59, p. 22-35. ISSN: 0885-2308.Detail
ŽMOLÍKOVÁ, K.; DELCROIX, M.; KINOSHITA, K.; OCHIAI, T.; NAKATANI, T.; BURGET, L.; ČERNOCKÝ, J. SpeakerBeam: Speaker Aware Neural Network for Target Speaker Extraction in Speech Mixtures. IEEE J-STSP, 2019, vol. 13, no. 4, p. 800-814. ISSN: 1932-4553.Detail
MATĚJKA, P.; PLCHOT, O.; ZEINALI, H.; MOŠNER, L.; SILNOVA, A.; BURGET, L.; NOVOTNÝ, O.; GLEMBEK, O. Analysis of BUT Submission in Far-Field Scenarios of VOiCES 2019 Challenge. In Proceedings of Interspeech. Proceedings of Interspeech. Graz: International Speech Communication Association, 2019. p. 2448-2452. ISSN: 1990-9772.Detail
DELCROIX, M.; ŽMOLÍKOVÁ, K.; OCHIAI, T.; KINOSHITA, K.; ARAKI, S.; NAKATANI, T. Compact Network for Speakerbeam Target Speaker Extraction. In Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019. p. 6965-6969. ISBN: 978-1-5386-4658-8.Detail
ALAM, J.; BOULIANNE, G.; GLEMBEK, O.; LOZANO DÍEZ, A.; MATĚJKA, P.; MIZERA, P.; MONTEIRO, J.; MOŠNER, L.; NOVOTNÝ, O.; PLCHOT, O.; ROHDIN, J.; SILNOVA, A.; SLAVÍČEK, J.; STAFYLAKIS, T.; WANG, S.; ZEINALI, H. ABC NIST SRE 2019 CTS System Description. Proceedings of NIST. Sentosa, Singapore: National Institute of Standards and Technology, 2019. p. 1-6.Detail
MATĚJKA, P.; PLCHOT, O.; GLEMBEK, O.; BURGET, L.; ROHDIN, J.; ZEINALI, H.; MOŠNER, L.; SILNOVA, A.; NOVOTNÝ, O.; DIEZ SÁNCHEZ, M.; ČERNOCKÝ, J. 13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE. COMPUTER SPEECH AND LANGUAGE, 2020, vol. 2020, no. 63, p. 1-15. ISSN: 0885-2308.Detail
ŽMOLÍKOVÁ, K.; DELCROIX, M.; KINOSHITA, K.; HIGUCHI, T.; NAKATANI, T.; ČERNOCKÝ, J. Optimization of Speaker-aware Multichannel Speech Extraction with ASR Criterion. In Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018. p. 6702-6706. ISBN: 978-1-5386-4658-8.Detail