Project Detail

Funding resources

Technologická agentura ČR - Program aplikovaného výzkumu a experimentálního vývoje ALFA

On the project

Cílem projektu je výzkum a vývoj v oblasti zpracování audia během jednání mezi lidmi (schůze, jednání týmů, jednání se zákazníky, atd.) vedoucí k vytvoření prototypu inteligentního asistenta pro pomoc během jednání (on-line), se zpracováním záznamu z jednání (off-line) a s následným systematickým uchováváním a sdílením záznamů jednání

Description in English
The goal of this project is research and development in the field of meeting audio processing (including meetings, team briefings, customer relations, etc.) leading to creation of prototype of an intelligent meeting assistant helping during a meeting (on-line), with the processing of meeting minutes (off-line), and with the following storage and sharing of meeting-related materials

Keywords
jednání; mezilidská komunikace; rozpoznávání řeči; strojový překlad; dolování dat; sumarizace; detekce slabých signálů; meeting asistent

Key words in English
meeting; human-human communication; speech recognition; machine translation; data mining; summarization; weak signal detection; meeting assistant

Mark

TA04011311

Default language

Czech

People responsible

Černocký Jan, prof. Dr. Ing. - principal person responsible
Burget Lukáš, doc. Ing., Ph.D. - fellow researcher
Novotný Ondřej, Ing., Ph.D. - fellow researcher
Silnova Anna, M.Sc., Ph.D. - fellow researcher
Skácel Miroslav, Ing. - fellow researcher
Ščavnický Tomáš, Bc. - fellow researcher

Units

Department of Computer Graphics and Multimedia
- responsible department (10.12.2013 - not assigned)
Speech Data Mining Research Group BUT Speech@FIT
- internal (10.12.2013 - 31.12.2017)
Department of Computer Graphics and Multimedia
- beneficiary (10.12.2013 - 31.12.2017)

Results

POPKOVÁ, A.; POVOLNÝ, F.; MATĚJKA, P.; GLEMBEK, O.; GRÉZL, F.; ČERNOCKÝ, J. Investigation of Bottle-Neck Features for Emotion Recognition. In 19th International Conference, TSD 2016, Brno , Czech Republic, September 12-16, 2016, Proceedings. Lecture Notes in Computer Science. Lecture Notes in Computer Science, Lecture Notes in Artificial Intelligence. Brno: International Speech Communication Association, 2016. p. 426-434. ISSN: 0302-9743.
Detail

VESELÝ, K.; BURGET, L.; ČERNOCKÝ, J. Semi-supervised DNN training with word selection for ASR. In Proceedings of Interspeech 2017. Proceedings of Interspeech. Stockholm: International Speech Communication Association, 2017. p. 3687-3691. ISSN: 1990-9772.
Detail

BENEŠ, K.; BASKAR, M.; BURGET, L. Residual Memory Networks in Language Modeling: Improving the Reputation of Feed-Forward Networks. In Proceedings of Interspeeech 2017. Proceedings of Interspeech. Stockholm: International Speech Communication Association, 2017. p. 284-288. ISSN: 1990-9772.
Detail

KARAFIÁT, M.; BASKAR, M.; MATĚJKA, P.; VESELÝ, K.; GRÉZL, F.; BURGET, L.; ČERNOCKÝ, J. 2016 BUT Babel system: Multilingual BLSTM acoustic model with i-vector based adaptation. In Proceedings of Interspeech 2017. Proceedings of Interspeech. Stockholm: International Speech Communication Association, 2017. p. 719-723. ISSN: 1990-9772.
Detail

GRÉZL, F.; KARAFIÁT, M. Boosting Performance on Low-resource Languages by Standard Corpora: AN ANALYSIS. In Proceeding of SLT 2016. San Diego: IEEE Signal Processing Society, 2016. p. 629-636. ISBN: 978-1-5090-4903-5.
Detail

KARAFIÁT, M.; BASKAR, M.; MATĚJKA, P.; VESELÝ, K.; GRÉZL, F.; ČERNOCKÝ, J. Multilingual BLSTM and Speaker-Specific Vector Adaptation in 2016 BUT BABEL SYSTEM. In Proceedings of SLT 2016. San Diego: IEEE Signal Processing Society, 2016. p. 637-643. ISBN: 978-1-5090-4903-5.
Detail

ŽMOLÍKOVÁ, K.; KARAFIÁT, M.; VESELÝ, K.; DELCROIX, M.; WATANABE, S.; BURGET, L.; ČERNOCKÝ, J. Data selection by sequence summarizing neural network in mismatch condition training. In Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016. p. 2354-2358. ISBN: 978-1-5108-3313-5.
Detail

ONDEL YANG, L.; BURGET, L.; ČERNOCKÝ, J. Variational Inference for Acoustic Unit Discovery. In Procedia Computer Science. Procedia Computer Science. Yogyakarta: Elsevier Science, 2016. p. 80-86. ISSN: 1877-0509.
Detail

EGOROVA, E.; SERRANO, J. Semi-Supervised Training of Language Model on Spanish Conversational Telephone Speech Data. In Procedia Computer Science. Procedia Computer Science. Yogyakarta: Elsevier Science, 2016. p. 114-120. ISSN: 1877-0509.
Detail

GRÉZL, F.; KARAFIÁT, M. Bottle-Neck Feature Extraction Structures for Multilingual Training and Porting. In Procedia Computer Science. Procedia Computer Science. Yogyakarta: Elsevier Science, 2016. p. 144-151. ISSN: 1877-0509.
Detail

KARAFIÁT, M.; BURGET, L.; GRÉZL, F.; VESELÝ, K.; ČERNOCKÝ, J. Multilingual Region-Dependent Transforms. In Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016. p. 5430-5434. ISBN: 978-1-4799-9988-0.
Detail

VESELÝ, K.; WATANABE, S.; ŽMOLÍKOVÁ, K.; KARAFIÁT, M.; BURGET, L.; ČERNOCKÝ, J. Sequence Summarizing Neural Network for Speaker Adaptation. In Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016. p. 5315-5319. ISBN: 978-1-4799-9988-0.
Detail

GRÉZL, F.; EGOROVA, E.; KARAFIÁT, M. Study of Large Data Resources for Multilingual Training and System Porting. In Procedia Computer Science. Procedia Computer Science. Yogyakarta: Elsevier Science, 2016. p. 15-22. ISSN: 1877-0509.
Detail

SKÁCEL, M.; SZŐKE, I. BUT QUESST 2015 System Description. In CEUR Workshop Proceedings. CEUR Workshop Proceedings. Wurzen: CEUR-WS.org, 2015. p. 1-3. ISSN: 1613-0073.
Detail

KARAFIÁT, M.; GRÉZL, F.; BURGET, L.; SZŐKE, I.; ČERNOCKÝ, J. Three ways to adapt a CTS recognizer to unseen reverberated speech in BUT system for the ASpIRE challenge. In Proceedings of Interspeech 2015. Proceedings of Interspeech. Dresden: International Speech Communication Association, 2015. p. 2454-2458. ISBN: 978-1-5108-1790-6. ISSN: 1990-9772.
Detail

PEŠÁN, J.; BURGET, L.; HEŘMANSKÝ, H.; VESELÝ, K. DNN derived filters for processing of modulation spectrum of speech. In Proceedings of Interspeech 2015. Proceedings of Interspeech. Dresden: International Speech Communication Association, 2015. p. 1908-1911. ISBN: 978-1-5108-1790-6. ISSN: 1990-9772.
Detail

HEŘMANSKÝ, H.; BURGET, L.; COHEN, J.; DUPOUX, E.; FELDMAN, N.; GODFREY, J.; KHUDANPUR, S.; MACIEJEWSKI, M.; MALLIDI, S.; MENON, A.; OGAWA, T.; PEDDINTI, V.; ROSE, R.; STERN, R.; WIESNER, M.; VESELÝ, K. TOWARDS MACHINES THAT KNOW WHEN THEY DO NOT KNOW: SUMMARY OF WORK DONE AT 2014 FREDERICK JELINEK MEMORIAL WORKSHOP. In Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015. p. 5009-5013. ISBN: 978-1-4673-6997-8.
Detail

MOTLÍČEK, P.; DEY, S.; MADIKERI, S.; BURGET, L. EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION. In Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015. p. 4445-4449. ISBN: 978-1-4673-6997-8.
Detail

Responsibility: Černocký Jan, prof. Dr. Ing.

VUT

Faculties

University Institutes

Parts

Meeting assistant (MINT)