prof. Ing.

Hynek Heřmanský

Dr. Eng.

FIT, UPGM – vědecký pracovník

+420 54114 1326
hermansky@fit.vut.cz

Odeslat VUT zprávu

prof. Ing. Hynek Heřmanský, Dr. Eng.

Publikace

  • 2022

    ŠŮSTEK, M.; SADHU, S.; HEŘMANSKÝ, H. Dealing with Unknowns in Continual Learning for End-to-end Automatic Speech Recognition. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Incheon: International Speech Communication Association, 2022. s. 1046-1050. ISSN: 1990-9772.
    Detail | WWW

  • 2019

    ONDEL YANG, L.; LI, R.; SELL, G.; HEŘMANSKÝ, H. Deriving Spectro-temporal Properties of Hearing from Speech Data. In Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019. s. 411-415. ISBN: 978-1-5386-4658-8.
    Detail | WWW

    YANG, J.; ONDEL YANG, L.; MANOHAR, V.; HEŘMANSKÝ, H. Towards Automatic Methods to Detect Errors in Transcriptions of Speech Recordings. In Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019. s. 3747-3751. ISBN: 978-1-5386-4658-8.
    Detail | WWW

  • 2015

    PEŠÁN, J.; BURGET, L.; HEŘMANSKÝ, H.; VESELÝ, K. DNN derived filters for processing of modulation spectrum of speech. In Proceedings of Interspeech 2015. Proceedings of Interspeech. Dresden: International Speech Communication Association, 2015. s. 1908-1911. ISBN: 978-1-5108-1790-6. ISSN: 1990-9772.
    Detail | WWW

    MALLIDI, S.; OGAWA, T.; VESELÝ, K.; NIDADAVOLU, P.; HEŘMANSKÝ, H. Autoencoder based multi-stream combination for noise robust speech recognition. In Proceeding of Interspeech 2015. Proceedings of Interspeech. Dresden: International Speech Communication Association, 2015. s. 3551-3555. ISBN: 978-1-5108-1790-6. ISSN: 1990-9772.
    Detail | WWW

    HSIAO, R.; MA, J.; HARTMANN, W.; KARAFIÁT, M.; GRÉZL, F.; BURGET, L.; SZŐKE, I.; ČERNOCKÝ, J.; WATANABE, S.; CHEN, Z.; MALLIDI, S.; HEŘMANSKÝ, H.; TSAKALIDIS, S.; SCHWARTZ, R. Robust Speech Recognition in Unknown Reverberant and Noisy Conditions. In Proceedings of 2015 IEEE Automatic Speech Recognition and Understanding Workshop. Scottsdale, Arizona: IEEE Signal Processing Society, 2015. s. 533-538. ISBN: 978-1-4799-7291-3.
    Detail | WWW

    HEŘMANSKÝ, H.; BURGET, L.; COHEN, J.; DUPOUX, E.; FELDMAN, N.; GODFREY, J.; KHUDANPUR, S.; MACIEJEWSKI, M.; MALLIDI, S.; MENON, A.; OGAWA, T.; PEDDINTI, V.; ROSE, R.; STERN, R.; WIESNER, M.; VESELÝ, K. TOWARDS MACHINES THAT KNOW WHEN THEY DO NOT KNOW: SUMMARY OF WORK DONE AT 2014 FREDERICK JELINEK MEMORIAL WORKSHOP. In Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015. s. 5009-5013. ISBN: 978-1-4673-6997-8.
    Detail | WWW

  • 2013

    PLCHOT, O.; MATSOUKAS, S.; MATĚJKA, P.; DEHAK, N.; MA, J.; CUMANI, S.; GLEMBEK, O.; HEŘMANSKÝ, H.; MESGARANI, N.; SOUFIFAR, M.; THOMAS, S.; ZHANG, B.; ZHOU, X. Developing A Speaker Identification System For The DARPA RATS Project. Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013. s. 6768-6772. ISBN: 978-1-4799-0355-9.
    Detail | WWW

  • 2010

    KOMBRINK, S.; HANNEMANN, M.; BURGET, L.; HEŘMANSKÝ, H. Recovery of Rare Words in Lecture Speech. Proc. Text, Speech and Dialogue 2010. Lecture Notes in Computer Science. Brno: Springer Verlag, 2010. s. 330-337. ISBN: 978-3-642-15759-2. ISSN: 0302-9743.
    Detail | WWW

  • 2009

    KOMBRINK, S.; BURGET, L.; MATĚJKA, P.; KARAFIÁT, M.; HEŘMANSKÝ, H. Posterior-based Out of Vocabulary Word Detection in Telephone Speech. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009. s. 80-83. ISSN: 1990-9772.
    Detail | WWW

  • 2008

    PINTO, J.; SZŐKE, I.; PRASANNA, S.; HEŘMANSKÝ, H. Fast Approximate Spoken Term Detection from Sequence of Phonemes. The 31st Annual International ACM SIGIR Conference 20-24 July 2008, Singapore. Singapore: Association for Computing Machinery, 2008. s. 28-33. ISBN: 978-90-365-2697-5.
    Detail

    BURGET, L.; SCHWARZ, P.; MATĚJKA, P.; HANNEMANN, M.; RASTROW, A.; WHITE, C.; KHUDANPUR, S.; HEŘMANSKÝ, H.; ČERNOCKÝ, J. Combination of strongly and weakly constrained recognizers for reliable detection of OOVs. Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Las Vegas: IEEE Signal Processing Society, 2008. s. 1-4. ISBN: 1-4244-1484-9.
    Detail | WWW

    WHITE, C.; ZWEIG, G.; BURGET, L.; SCHWARZ, P.; HEŘMANSKÝ, H. Confidence estimation, OOV detection and language ID using phone-to-word transduction and phone-level alignments. Proc. 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing. Las Vegas: IEEE Signal Processing Society, 2008. s. 1-4. ISBN: 1-4244-1484-9.
    Detail | WWW

  • 2007

    HEŘMANSKÝ, H.; BURGET, L.; SCHWARZ, P.; MATĚJKA, P.; HANNEMANN, M.; RASTROW, A.; WHITE, C.; KHUDANPUR, S.; ČERNOCKÝ, J. Recovery from Model Inconsistency in Multilingual Speech Recognition. Baltimore: Johns Hopkins University, 2007. s. 0-0.
    Detail

  • 2004

    FOUSEK, P.; SVOJANOVSKÝ, P.; GRÉZL, F.; HEŘMANSKÝ, H. New Nonsense Syllables Database - Analyses and Preliminary ASR Experiments. Proc. 8th International Conference on Spoken Language Processing. 8th International Conference on Spoken Language Processing. Jeju Island: Sunjin Printing Co, 2004. s. 348-351. ISSN: 1225-4111.
    Detail

  • 2003

    SCHWARZ, P.; HEŘMANSKÝ, H.; MATĚJKA, P. On Use of Temporal Dynamics of Speech for Language Identification. Proceedings of Language Recognition Workshop 2003. NIST Gaithersburg, MD USA: 2003. s. 56-62.
    Detail

    GRÉZL, F.; HEŘMANSKÝ, H. Local averaging and differentiating of spectral plane for TRAP-based ASR. Proc. EUROSPEECH 2003. European Conference EUROSPEECH. Geneva: Institute for Perceptual Artificial Intelligence, 2003. s. 0-0. ISSN: 1018-4074.
    Detail | WWW

    MATĚJKA, P.; SCHWARZ, P.; HEŘMANSKÝ, H.; ČERNOCKÝ, J. Phoneme Recognition using Temporal Patterns. Proc. 6th International Conference Text, Speech and Dialogue, TSD2003. Ceske Budejovice: Springer Verlag, 2003. s. 465-472. ISBN: 3-540-20024-X.
    Detail

    MATĚJKA, P., SCHWARZ, P., ČERNOCKÝ, J., HEŘMANSKÝ, H. Phoneme Recognition using Temporal Patterns. In In Proceedings of the conference TSD'2003. International Conference on Text Speech and Dialogue, TSD 2003. 2003. s. 198 ( s.)ISBN: 3-540-20024- X.
    Detail

    HEŘMANSKÝ, H., MATĚJKA, P., SCHWARZ, P. On Use of Temporal Dynamics of Speech for Language Identification. In Language Recognition Workshop. NIST Gaithersburg, MD USA: 2003. s. 56 ( s.)
    Detail

  • 2002

    BURGET, L.; DUPONT, S.; GARUDADRI, H.; GRÉZL, F.; HEŘMANSKÝ, H.; JAIN, P.; KAJAREKAR, S.; MORGAN, N. QUALCOMM-ICSI-OGI Features for ASR. Proc. 7th International Conference on Spoken Language Processing. Denver: International Speech Communication Association, 2002. s. 4-7. ISBN: 1-876346-42-6.
    Detail | WWW

    GARUDADRI, H.; HEŘMANSKÝ, H.; MORGAN, N.; BENITEZ, C.; BURGET, L.; KAJAREKAR, S.; GRÉZL, F.; JAIN, P.; MOTLÍČEK, P. Distributed Voice Recognition System Utilizing Multistream Network Feature Processing. San Diego: Qualcomm, 2002. s. 0-0.
    Detail

  • 2001

    HEŘMANSKÝ, H. Human Speech Perception: Some Lessons from Automatic Speech Recognition. Proc. Text Speech and Dialogue 2001. Železná Ruda: 2001. s. 0-0.
    Detail

  • 2000

    HEŘMANSKÝ, H. Connectionist Feature Extraction for Conventional HMM System. Proc. International Conference on Acoustics, Speech and Signal Processing. Istanbul: 2000. s. 0-0.
    Detail

  • 1998

    HEŘMANSKÝ, H. TRAPS- Classifiers of Temporal Patterns. Proc. International Conference on Spoken Language Processing (ICSLP). Sydney: 1998. s. 0-0.
    Detail

  • 1997

    HEŘMANSKÝ, H. Auditory Modelling for Automatic Recognition of Speech. Proc.The First European Conference on Signal Analysis and Prediction. Praha: 1997. s. 17-21.
    Detail

*) Citace publikací se generují jednou za 24 hodin.