prof. Ing.
Hynek Heřmanský
Dr. Eng.
FIT, UPGM – vědecký pracovník
+420 54114 1326
hermansky@fit.vut.cz
Publikace
2022
ŠŮSTEK, M.; SADHU, S.; HEŘMANSKÝ, H. Dealing with Unknowns in Continual Learning for End-to-end Automatic Speech Recognition. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Incheon: International Speech Communication Association, 2022.
s. 1046-1050. ISSN: 1990-9772.
Detail | WWW2019
ONDEL YANG, L.; LI, R.; SELL, G.; HEŘMANSKÝ, H. Deriving Spectro-temporal Properties of Hearing from Speech Data. In Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019.
s. 411-415. ISBN: 978-1-5386-4658-8.
Detail | WWWYANG, J.; ONDEL YANG, L.; MANOHAR, V.; HEŘMANSKÝ, H. Towards Automatic Methods to Detect Errors in Transcriptions of Speech Recordings. In Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019.
s. 3747-3751. ISBN: 978-1-5386-4658-8.
Detail | WWW2015
PEŠÁN, J.; BURGET, L.; HEŘMANSKÝ, H.; VESELÝ, K. DNN derived filters for processing of modulation spectrum of speech. In Proceedings of Interspeech 2015. Proceedings of Interspeech. Dresden: International Speech Communication Association, 2015.
s. 1908-1911. ISBN: 978-1-5108-1790-6. ISSN: 1990-9772.
Detail | WWWMALLIDI, S.; OGAWA, T.; VESELÝ, K.; NIDADAVOLU, P.; HEŘMANSKÝ, H. Autoencoder based multi-stream combination for noise robust speech recognition. In Proceeding of Interspeech 2015. Proceedings of Interspeech. Dresden: International Speech Communication Association, 2015.
s. 3551-3555. ISBN: 978-1-5108-1790-6. ISSN: 1990-9772.
Detail | WWWHSIAO, R.; MA, J.; HARTMANN, W.; KARAFIÁT, M.; GRÉZL, F.; BURGET, L.; SZŐKE, I.; ČERNOCKÝ, J.; WATANABE, S.; CHEN, Z.; MALLIDI, S.; HEŘMANSKÝ, H.; TSAKALIDIS, S.; SCHWARTZ, R. Robust Speech Recognition in Unknown Reverberant and Noisy Conditions. In Proceedings of 2015 IEEE Automatic Speech Recognition and Understanding Workshop. Scottsdale, Arizona: IEEE Signal Processing Society, 2015.
s. 533-538. ISBN: 978-1-4799-7291-3.
Detail | WWWHEŘMANSKÝ, H.; BURGET, L.; COHEN, J.; DUPOUX, E.; FELDMAN, N.; GODFREY, J.; KHUDANPUR, S.; MACIEJEWSKI, M.; MALLIDI, S.; MENON, A.; OGAWA, T.; PEDDINTI, V.; ROSE, R.; STERN, R.; WIESNER, M.; VESELÝ, K. TOWARDS MACHINES THAT KNOW WHEN THEY DO NOT KNOW: SUMMARY OF WORK DONE AT 2014 FREDERICK JELINEK MEMORIAL WORKSHOP. In Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015.
s. 5009-5013. ISBN: 978-1-4673-6997-8.
Detail | WWW2013
PLCHOT, O.; MATSOUKAS, S.; MATĚJKA, P.; DEHAK, N.; MA, J.; CUMANI, S.; GLEMBEK, O.; HEŘMANSKÝ, H.; MESGARANI, N.; SOUFIFAR, M.; THOMAS, S.; ZHANG, B.; ZHOU, X. Developing A Speaker Identification System For The DARPA RATS Project. Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013.
s. 6768-6772. ISBN: 978-1-4799-0355-9.
Detail | WWW2010
KOMBRINK, S.; HANNEMANN, M.; BURGET, L.; HEŘMANSKÝ, H. Recovery of Rare Words in Lecture Speech. Proc. Text, Speech and Dialogue 2010. Lecture Notes in Computer Science. Brno: Springer Verlag, 2010.
s. 330-337. ISBN: 978-3-642-15759-2. ISSN: 0302-9743.
Detail | WWW2009
KOMBRINK, S.; BURGET, L.; MATĚJKA, P.; KARAFIÁT, M.; HEŘMANSKÝ, H. Posterior-based Out of Vocabulary Word Detection in Telephone Speech. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009.
s. 80-83. ISSN: 1990-9772.
Detail | WWW2008
PINTO, J.; SZŐKE, I.; PRASANNA, S.; HEŘMANSKÝ, H. Fast Approximate Spoken Term Detection from Sequence of Phonemes. The 31st Annual International ACM SIGIR Conference 20-24 July 2008, Singapore. Singapore: Association for Computing Machinery, 2008.
s. 28-33. ISBN: 978-90-365-2697-5.
DetailBURGET, L.; SCHWARZ, P.; MATĚJKA, P.; HANNEMANN, M.; RASTROW, A.; WHITE, C.; KHUDANPUR, S.; HEŘMANSKÝ, H.; ČERNOCKÝ, J. Combination of strongly and weakly constrained recognizers for reliable detection of OOVs. Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Las Vegas: IEEE Signal Processing Society, 2008.
s. 1-4. ISBN: 1-4244-1484-9.
Detail | WWWWHITE, C.; ZWEIG, G.; BURGET, L.; SCHWARZ, P.; HEŘMANSKÝ, H. Confidence estimation, OOV detection and language ID using phone-to-word transduction and phone-level alignments. Proc. 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing. Las Vegas: IEEE Signal Processing Society, 2008.
s. 1-4. ISBN: 1-4244-1484-9.
Detail | WWW2007
HEŘMANSKÝ, H.; BURGET, L.; SCHWARZ, P.; MATĚJKA, P.; HANNEMANN, M.; RASTROW, A.; WHITE, C.; KHUDANPUR, S.; ČERNOCKÝ, J. Recovery from Model Inconsistency in Multilingual Speech Recognition. Baltimore: Johns Hopkins University, 2007.
s. 0-0.
Detail2004
FOUSEK, P.; SVOJANOVSKÝ, P.; GRÉZL, F.; HEŘMANSKÝ, H. New Nonsense Syllables Database - Analyses and Preliminary ASR Experiments. Proc. 8th International Conference on Spoken Language Processing. 8th International Conference on Spoken Language Processing. Jeju Island: Sunjin Printing Co, 2004.
s. 348-351. ISSN: 1225-4111.
Detail2003
SCHWARZ, P.; HEŘMANSKÝ, H.; MATĚJKA, P. On Use of Temporal Dynamics of Speech for Language Identification. Proceedings of Language Recognition Workshop 2003. NIST Gaithersburg, MD USA: 2003.
s. 56-62.
DetailGRÉZL, F.; HEŘMANSKÝ, H. Local averaging and differentiating of spectral plane for TRAP-based ASR. Proc. EUROSPEECH 2003. European Conference EUROSPEECH. Geneva: Institute for Perceptual Artificial Intelligence, 2003.
s. 0-0. ISSN: 1018-4074.
Detail | WWWMATĚJKA, P.; SCHWARZ, P.; HEŘMANSKÝ, H.; ČERNOCKÝ, J. Phoneme Recognition using Temporal Patterns. Proc. 6th International Conference Text, Speech and Dialogue, TSD2003. Ceske Budejovice: Springer Verlag, 2003.
s. 465-472. ISBN: 3-540-20024-X.
DetailMATĚJKA, P., SCHWARZ, P., ČERNOCKÝ, J., HEŘMANSKÝ, H. Phoneme Recognition using Temporal Patterns. In In Proceedings of the conference TSD'2003. International Conference on Text Speech and Dialogue, TSD 2003. 2003.
s. 198 ( s.) ISBN: 3-540-20024- X.
DetailHEŘMANSKÝ, H., MATĚJKA, P., SCHWARZ, P. On Use of Temporal Dynamics of Speech for Language Identification. In Language Recognition Workshop. NIST Gaithersburg, MD USA: 2003.
s. 56 ( s.)
Detail2002
BURGET, L.; DUPONT, S.; GARUDADRI, H.; GRÉZL, F.; HEŘMANSKÝ, H.; JAIN, P.; KAJAREKAR, S.; MORGAN, N. QUALCOMM-ICSI-OGI Features for ASR. Proc. 7th International Conference on Spoken Language Processing. Denver: International Speech Communication Association, 2002.
s. 4-7. ISBN: 1-876346-42-6.
Detail | WWWGARUDADRI, H.; HEŘMANSKÝ, H.; MORGAN, N.; BENITEZ, C.; BURGET, L.; KAJAREKAR, S.; GRÉZL, F.; JAIN, P.; MOTLÍČEK, P. Distributed Voice Recognition System Utilizing Multistream Network Feature Processing. San Diego: Qualcomm, 2002.
s. 0-0.
Detail2001
HEŘMANSKÝ, H. Human Speech Perception: Some Lessons from Automatic Speech Recognition. Proc. Text Speech and Dialogue 2001. Železná Ruda: 2001.
s. 0-0.
Detail2000
HEŘMANSKÝ, H. Connectionist Feature Extraction for Conventional HMM System. Proc. International Conference on Acoustics, Speech and Signal Processing. Istanbul: 2000.
s. 0-0.
Detail1998
HEŘMANSKÝ, H. TRAPS- Classifiers of Temporal Patterns. Proc. International Conference on Spoken Language Processing (ICSLP). Sydney: 1998.
s. 0-0.
Detail1997
HEŘMANSKÝ, H. Auditory Modelling for Automatic Recognition of Speech. Proc.The First European Conference on Signal Analysis and Prediction. Praha: 1997.
s. 17-21.
Detail
*) Citace publikací se generují jednou za 24 hodin.