prof. Ing.

Hynek Heřmanský

FIT, DCGM – Researcher

+420 54114 1326
hermansky@fit.vut.cz

Send BUT message

prof. Ing. Hynek Heřmanský

Publications

  • 2022

    ŠŮSTEK, M.; SADHU, S.; HEŘMANSKÝ, H. Dealing with Unknowns in Continual Learning for End-to-end Automatic Speech Recognition. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Incheon: International Speech Communication Association, 2022. p. 1046-1050. ISSN: 1990-9772.
    Detail | WWW

  • 2019

    YANG, J.; ONDEL YANG, L.; MANOHAR, V.; HEŘMANSKÝ, H. Towards Automatic Methods to Detect Errors in Transcriptions of Speech Recordings. In Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019. p. 3747-3751. ISBN: 978-1-5386-4658-8.
    Detail | WWW

    ONDEL YANG, L.; LI, R.; SELL, G.; HEŘMANSKÝ, H. Deriving Spectro-temporal Properties of Hearing from Speech Data. In Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019. p. 411-415. ISBN: 978-1-5386-4658-8.
    Detail | WWW

  • 2015

    HSIAO, R.; MA, J.; HARTMANN, W.; KARAFIÁT, M.; GRÉZL, F.; BURGET, L.; SZŐKE, I.; ČERNOCKÝ, J.; WATANABE, S.; CHEN, Z.; MALLIDI, S.; HEŘMANSKÝ, H.; TSAKALIDIS, S.; SCHWARTZ, R. Robust Speech Recognition in Unknown Reverberant and Noisy Conditions. In Proceedings of 2015 IEEE Automatic Speech Recognition and Understanding Workshop. Scottsdale, Arizona: IEEE Signal Processing Society, 2015. p. 533-538. ISBN: 978-1-4799-7291-3.
    Detail | WWW

    MALLIDI, S.; OGAWA, T.; VESELÝ, K.; NIDADAVOLU, P.; HEŘMANSKÝ, H. Autoencoder based multi-stream combination for noise robust speech recognition. In Proceeding of Interspeech 2015. Proceedings of Interspeech. Dresden: International Speech Communication Association, 2015. p. 3551-3555. ISBN: 978-1-5108-1790-6. ISSN: 1990-9772.
    Detail | WWW

    PEŠÁN, J.; BURGET, L.; HEŘMANSKÝ, H.; VESELÝ, K. DNN derived filters for processing of modulation spectrum of speech. In Proceedings of Interspeech 2015. Proceedings of Interspeech. Dresden: International Speech Communication Association, 2015. p. 1908-1911. ISBN: 978-1-5108-1790-6. ISSN: 1990-9772.
    Detail | WWW

    HEŘMANSKÝ, H.; BURGET, L.; COHEN, J.; DUPOUX, E.; FELDMAN, N.; GODFREY, J.; KHUDANPUR, S.; MACIEJEWSKI, M.; MALLIDI, S.; MENON, A.; OGAWA, T.; PEDDINTI, V.; ROSE, R.; STERN, R.; WIESNER, M.; VESELÝ, K. Towards Machines That Know When They Do Not Know: Summary of Work Done at 2014 FREDERICK JELINEK MEMORIAL WORKSHOP. In Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015. p. 5009-5013. ISBN: 978-1-4673-6997-8.
    Detail | WWW

  • 2013

    PLCHOT, O.; MATSOUKAS, S.; MATĚJKA, P.; DEHAK, N.; MA, J.; CUMANI, S.; GLEMBEK, O.; HEŘMANSKÝ, H.; MESGARANI, N.; SOUFIFAR, M.; THOMAS, S.; ZHANG, B.; ZHOU, X. Developing A Speaker Identification System For The DARPA RATS Project. Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013. p. 6768-6772. ISBN: 978-1-4799-0355-9.
    Detail | WWW

  • 2010

    KOMBRINK, S.; HANNEMANN, M.; BURGET, L.; HEŘMANSKÝ, H. Recovery of Rare Words in Lecture Speech. Proc. Text, Speech and Dialogue 2010. Lecture Notes in Computer Science. Brno: Springer Verlag, 2010. p. 330-337. ISBN: 978-3-642-15759-2. ISSN: 0302-9743.
    Detail | WWW

  • 2009

    KOMBRINK, S.; BURGET, L.; MATĚJKA, P.; KARAFIÁT, M.; HEŘMANSKÝ, H. Posterior-based Out of Vocabulary Word Detection in Telephone Speech. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009. p. 80-83. ISSN: 1990-9772.
    Detail | WWW

  • 2008

    BURGET, L.; SCHWARZ, P.; MATĚJKA, P.; HANNEMANN, M.; RASTROW, A.; WHITE, C.; KHUDANPUR, S.; HEŘMANSKÝ, H.; ČERNOCKÝ, J. Combination of strongly and weakly constrained recognizers for reliable detection of OOVs. Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Las Vegas: IEEE Signal Processing Society, 2008. p. 1-4. ISBN: 1-4244-1484-9.
    Detail | WWW

    PINTO, J.; SZŐKE, I.; PRASANNA, S.; HEŘMANSKÝ, H. Fast Approximate Spoken Term Detection from Sequence of Phonemes. The 31st Annual International ACM SIGIR Conference 20-24 July 2008, Singapore. Singapore: Association for Computing Machinery, 2008. p. 28-33. ISBN: 978-90-365-2697-5.
    Detail

    WHITE, C.; ZWEIG, G.; BURGET, L.; SCHWARZ, P.; HEŘMANSKÝ, H. Confidence estimation, OOV detection and language ID using phone-to-word transduction and phone-level alignments. Proc. 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing. Las Vegas: IEEE Signal Processing Society, 2008. p. 1-4. ISBN: 1-4244-1484-9.
    Detail | WWW

  • 2007

    HEŘMANSKÝ, H.; BURGET, L.; SCHWARZ, P.; MATĚJKA, P.; HANNEMANN, M.; RASTROW, A.; WHITE, C.; KHUDANPUR, S.; ČERNOCKÝ, J. Recovery from Model Inconsistency in Multilingual Speech Recognition. Baltimore: Johns Hopkins University, 2007. p. 0-0.
    Detail

  • 2004

    FOUSEK, P., SVOJANOVSKÝ, P., GRÉZL, F., HEŘMANSKÝ, H. New Nonsense Syllables Database - Analyses and Preliminary ASR Experiments. In Proc. 8th International Conference on Spoken Language Processing. 8th International Conference on Spoken Language Processing. Jeju Island: Sunjin Printing Co, 2004. p. 348-351. ISSN: 1225- 4111.
    Detail

  • 2003

    MATĚJKA, P.; SCHWARZ, P.; HEŘMANSKÝ, H.; ČERNOCKÝ, J. Phoneme Recognition using Temporal Patterns. Proc. 6th International Conference Text, Speech and Dialogue, TSD2003. Ceske Budejovice: Springer Verlag, 2003. p. 465-472. ISBN: 3-540-20024-X.
    Detail

    MATĚJKA, P., SCHWARZ, P., ČERNOCKÝ, J., HEŘMANSKÝ, H. Phoneme Recognition using Temporal Patterns. In In Proceedings of the conference TSD'2003. International Conference on Text Speech and Dialogue, TSD 2003. 2003. p. 198 ( p.)ISBN: 3-540-20024- X.
    Detail

    GRÉZL, F.; HEŘMANSKÝ, H. Local averaging and differentiating of spectral plane for TRAP-based ASR. Proc. EUROSPEECH 2003. European Conference EUROSPEECH. Geneva: Institute for Perceptual Artificial Intelligence, 2003. p. 0-0. ISSN: 1018-4074.
    Detail | WWW

    SCHWARZ, P., HEŘMANSKÝ, H., MATĚJKA, P. Použití časové dynamiky k rozpoznávání jazyků z mluvené řeči. In Proceedings of Language Recognition Workshop 2003. NIST Gaithersburg, MD USA: 2003. p. 56-62.
    Detail

  • 2002

    GARUDADRI, H.; HEŘMANSKÝ, H.; MORGAN, N.; BENITEZ, C.; BURGET, L.; KAJAREKAR, S.; GRÉZL, F.; JAIN, P.; MOTLÍČEK, P. Distributed Voice Recognition System Utilizing Multistream Network Feature Processing. San Diego: Qualcomm, 2002. p. 0-0.
    Detail

    BURGET, L.; DUPONT, S.; GARUDADRI, H.; GRÉZL, F.; HEŘMANSKÝ, H.; JAIN, P.; KAJAREKAR, S.; MORGAN, N. QUALCOMM-ICSI-OGI Features for ASR. Proc. 7th International Conference on Spoken Language Processing. Denver: International Speech Communication Association, 2002. p. 4-7. ISBN: 1-876346-42-6.
    Detail | WWW

*) Publications are generated once a 24 hours.