prof. Ing.

Hynek Heřmanský

Dr. Eng.

FIT, UPGM – vědecký pracovník

+420 54114 1326
hermansky@fit.vut.cz

Odeslat VUT zprávu

Publikace

2022
ŠŮSTEK, M.; SADHU, S.; HEŘMANSKÝ, H. Dealing with Unknowns in Continual Learning for End-to-end Automatic Speech Recognition. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Incheon: International Speech Communication Association, 2022. s. 1046-1050. ISSN: 1990-9772.
Detail | WWW
2019
ONDEL YANG, L.; LI, R.; SELL, G.; HEŘMANSKÝ, H. Deriving Spectro-temporal Properties of Hearing from Speech Data. In Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019. s. 411-415. ISBN: 978-1-5386-4658-8.
Detail | WWW
YANG, J.; ONDEL YANG, L.; MANOHAR, V.; HEŘMANSKÝ, H. Towards Automatic Methods to Detect Errors in Transcriptions of Speech Recordings. In Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019. s. 3747-3751. ISBN: 978-1-5386-4658-8.
Detail | WWW
2015
PEŠÁN, J.; BURGET, L.; HEŘMANSKÝ, H.; VESELÝ, K. DNN derived filters for processing of modulation spectrum of speech. In Proceedings of Interspeech 2015. Proceedings of Interspeech. Dresden: International Speech Communication Association, 2015. s. 1908-1911. ISBN: 978-1-5108-1790-6. ISSN: 1990-9772.
Detail | WWW
MALLIDI, S.; OGAWA, T.; VESELÝ, K.; NIDADAVOLU, P.; HEŘMANSKÝ, H. Autoencoder based multi-stream combination for noise robust speech recognition. In Proceeding of Interspeech 2015. Proceedings of Interspeech. Dresden: International Speech Communication Association, 2015. s. 3551-3555. ISBN: 978-1-5108-1790-6. ISSN: 1990-9772.
Detail | WWW
HSIAO, R.; MA, J.; HARTMANN, W.; KARAFIÁT, M.; GRÉZL, F.; BURGET, L.; SZŐKE, I.; ČERNOCKÝ, J.; WATANABE, S.; CHEN, Z.; MALLIDI, S.; HEŘMANSKÝ, H.; TSAKALIDIS, S.; SCHWARTZ, R. Robust Speech Recognition in Unknown Reverberant and Noisy Conditions. In Proceedings of 2015 IEEE Automatic Speech Recognition and Understanding Workshop. Scottsdale, Arizona: IEEE Signal Processing Society, 2015. s. 533-538. ISBN: 978-1-4799-7291-3.
Detail | WWW
HEŘMANSKÝ, H.; BURGET, L.; COHEN, J.; DUPOUX, E.; FELDMAN, N.; GODFREY, J.; KHUDANPUR, S.; MACIEJEWSKI, M.; MALLIDI, S.; MENON, A.; OGAWA, T.; PEDDINTI, V.; ROSE, R.; STERN, R.; WIESNER, M.; VESELÝ, K. TOWARDS MACHINES THAT KNOW WHEN THEY DO NOT KNOW: SUMMARY OF WORK DONE AT 2014 FREDERICK JELINEK MEMORIAL WORKSHOP. In Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015. s. 5009-5013. ISBN: 978-1-4673-6997-8.
Detail | WWW
2013
PLCHOT, O.; MATSOUKAS, S.; MATĚJKA, P.; DEHAK, N.; MA, J.; CUMANI, S.; GLEMBEK, O.; HEŘMANSKÝ, H.; MESGARANI, N.; SOUFIFAR, M.; THOMAS, S.; ZHANG, B.; ZHOU, X. Developing A Speaker Identification System For The DARPA RATS Project. Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013. s. 6768-6772. ISBN: 978-1-4799-0355-9.
Detail | WWW
2010
KOMBRINK, S.; HANNEMANN, M.; BURGET, L.; HEŘMANSKÝ, H. Recovery of Rare Words in Lecture Speech. Proc. Text, Speech and Dialogue 2010. Lecture Notes in Computer Science. Brno: Springer Verlag, 2010. s. 330-337. ISBN: 978-3-642-15759-2. ISSN: 0302-9743.
Detail | WWW
2009
KOMBRINK, S.; BURGET, L.; MATĚJKA, P.; KARAFIÁT, M.; HEŘMANSKÝ, H. Posterior-based Out of Vocabulary Word Detection in Telephone Speech. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009. s. 80-83. ISSN: 1990-9772.
Detail | WWW
2008
PINTO, J.; SZŐKE, I.; PRASANNA, S.; HEŘMANSKÝ, H. Fast Approximate Spoken Term Detection from Sequence of Phonemes. The 31st Annual International ACM SIGIR Conference 20-24 July 2008, Singapore. Singapore: Association for Computing Machinery, 2008. s. 28-33. ISBN: 978-90-365-2697-5.
Detail
BURGET, L.; SCHWARZ, P.; MATĚJKA, P.; HANNEMANN, M.; RASTROW, A.; WHITE, C.; KHUDANPUR, S.; HEŘMANSKÝ, H.; ČERNOCKÝ, J. Combination of strongly and weakly constrained recognizers for reliable detection of OOVs. Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Las Vegas: IEEE Signal Processing Society, 2008. s. 1-4. ISBN: 1-4244-1484-9.
Detail | WWW
WHITE, C.; ZWEIG, G.; BURGET, L.; SCHWARZ, P.; HEŘMANSKÝ, H. Confidence estimation, OOV detection and language ID using phone-to-word transduction and phone-level alignments. Proc. 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing. Las Vegas: IEEE Signal Processing Society, 2008. s. 1-4. ISBN: 1-4244-1484-9.
Detail | WWW
2007
HEŘMANSKÝ, H.; BURGET, L.; SCHWARZ, P.; MATĚJKA, P.; HANNEMANN, M.; RASTROW, A.; WHITE, C.; KHUDANPUR, S.; ČERNOCKÝ, J. Recovery from Model Inconsistency in Multilingual Speech Recognition. Baltimore: Johns Hopkins University, 2007. s. 0-0.
Detail
2004
FOUSEK, P.; SVOJANOVSKÝ, P.; GRÉZL, F.; HEŘMANSKÝ, H. New Nonsense Syllables Database - Analyses and Preliminary ASR Experiments. Proc. 8th International Conference on Spoken Language Processing. 8th International Conference on Spoken Language Processing. Jeju Island: Sunjin Printing Co, 2004. s. 348-351. ISSN: 1225-4111.
Detail
2003
SCHWARZ, P.; HEŘMANSKÝ, H.; MATĚJKA, P. On Use of Temporal Dynamics of Speech for Language Identification. Proceedings of Language Recognition Workshop 2003. NIST Gaithersburg, MD USA: 2003. s. 56-62.
Detail
GRÉZL, F.; HEŘMANSKÝ, H. Local averaging and differentiating of spectral plane for TRAP-based ASR. Proc. EUROSPEECH 2003. European Conference EUROSPEECH. Geneva: Institute for Perceptual Artificial Intelligence, 2003. s. 0-0. ISSN: 1018-4074.
Detail | WWW
MATĚJKA, P.; SCHWARZ, P.; HEŘMANSKÝ, H.; ČERNOCKÝ, J. Phoneme Recognition using Temporal Patterns. Proc. 6th International Conference Text, Speech and Dialogue, TSD2003. Ceske Budejovice: Springer Verlag, 2003. s. 465-472. ISBN: 3-540-20024-X.
Detail
MATĚJKA, P., SCHWARZ, P., ČERNOCKÝ, J., HEŘMANSKÝ, H. Phoneme Recognition using Temporal Patterns. In In Proceedings of the conference TSD'2003. International Conference on Text Speech and Dialogue, TSD 2003. 2003. s. 198 ( s.)ISBN: 3-540-20024- X.
Detail
HEŘMANSKÝ, H., MATĚJKA, P., SCHWARZ, P. On Use of Temporal Dynamics of Speech for Language Identification. In Language Recognition Workshop. NIST Gaithersburg, MD USA: 2003. s. 56 ( s.)
Detail
2002
BURGET, L.; DUPONT, S.; GARUDADRI, H.; GRÉZL, F.; HEŘMANSKÝ, H.; JAIN, P.; KAJAREKAR, S.; MORGAN, N. QUALCOMM-ICSI-OGI Features for ASR. Proc. 7th International Conference on Spoken Language Processing. Denver: International Speech Communication Association, 2002. s. 4-7. ISBN: 1-876346-42-6.
Detail | WWW
GARUDADRI, H.; HEŘMANSKÝ, H.; MORGAN, N.; BENITEZ, C.; BURGET, L.; KAJAREKAR, S.; GRÉZL, F.; JAIN, P.; MOTLÍČEK, P. Distributed Voice Recognition System Utilizing Multistream Network Feature Processing. San Diego: Qualcomm, 2002. s. 0-0.
Detail
2001
HEŘMANSKÝ, H. Human Speech Perception: Some Lessons from Automatic Speech Recognition. Proc. Text Speech and Dialogue 2001. Železná Ruda: 2001. s. 0-0.
Detail
2000
HEŘMANSKÝ, H. Connectionist Feature Extraction for Conventional HMM System. Proc. International Conference on Acoustics, Speech and Signal Processing. Istanbul: 2000. s. 0-0.
Detail
1998
HEŘMANSKÝ, H. TRAPS- Classifiers of Temporal Patterns. Proc. International Conference on Spoken Language Processing (ICSLP). Sydney: 1998. s. 0-0.
Detail
1997
HEŘMANSKÝ, H. Auditory Modelling for Automatic Recognition of Speech. Proc.The First European Conference on Signal Analysis and Prediction. Praha: 1997. s. 17-21.
Detail

*) Citace publikací se generují jednou za 24 hodin.

VUT

Fakulty

Vysokoškolské ústavy

Součásti

Hynek Heřmanský

Publikace