Project detail

Speech Recognition under Real-World Conditions

Duration: 01.01.2008 — 31.12.2011

Funding resources

Czech Science Foundation - Standardní projekty

- whole funder (2008-01-01 - 2011-12-31)

On the project

Projekt o Rozpoznávání mluvené řeči v reálných podmínkách. Projekt navazuje na předchozí grantově podporovaný výzkum, v němž se řešitelskému týmu podařilo vyvinout a částečně i realizovat základní metody rozpoznávání řeči v českém jazyce. Aby však mohly být úspěšně nasazeny v nejvíce žádaných aplikacích, jako jsou přepisy hovorů, záznamů diskusí nebo jednání v soudních síních, musí být pozornost zaměřena na analýzu a modelování běžné mluvené (hovorové) řeči zaznamenávané v reálných podmínkách za přítomnosti šumu, hluků, případně dalších mluvících osob.

Description in English
This project follows preceding research projects within which the participating teams developed and implemented basic speech recognition algorithms for Czech. For their successful use in the most challenging applications, such as transcription of talks, recordings of court-hearings, etc., the research must continue in analysis and modelling of colloquial speech recorded in real conditions (e.g. with different backgrounds, noises, or with cross-talk). The main goal of this four-year project is to design and test new speech feature extraction techniques, background or noise suppression, speaker change-point detection, quick adaptation to new speaker characteristics, to improve lexical and phonetic inventory of recognition systems for colloquial speech, and also to develop language models with better coverage of inflective nature of Czech. This project will contribute to advancing the state-of-the-art in basic research of speech recognition and it will facilitate the integration of involved teams into European research community.

Keywords
rozpoznávání řeči

Key words in English
speech recognition

Mark

GA102/08/0707

Default language

Czech

People responsible

Černocký Jan, prof. Dr. Ing. - fellow researcher
Müller Luděk - fellow researcher
Nouza Jan - fellow researcher
Pollák Petr - principal person responsible

Units

Faculty of Information Technology
- beneficiary (2011-05-13 - not assigned)
Department of Computer Graphics and Multimedia
- co-beneficiary (2008-01-01 - 2011-12-31)

Results

BURGET, L.; SCHWARZ, P.; MATĚJKA, P.; HANNEMANN, M.; RASTROW, A.; WHITE, C.; KHUDANPUR, S.; HEŘMANSKÝ, H.; ČERNOCKÝ, J. Combination of strongly and weakly constrained recognizers for reliable detection of OOVs. Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Las Vegas: IEEE Signal Processing Society, 2008. p. 1-4. ISBN: 1-4244-1484-9.
Detail

GRÉZL, F.; FOUSEK, P. Optimizing bottle-neck features for LVCSR. 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing. Las Vegas, Nevada: IEEE Signal Processing Society, 2008. p. 4729-4732. ISBN: 1-4244-1484-9.
Detail

PLCHOT, O.; HUBEIKA, V.; BURGET, L.; SCHWARZ, P.; MATĚJKA, P. Acquisition of Telephone Data from Radio Broadcasts with Applications to Language Recognition. Proc. 11th International Conference on Text, Speech and Dialogue. Berlin: Springer Verlag, 2008. p. 477-483. ISBN: 978-3-540-87390-7.
Detail

GLEMBEK, O.; BURGET, L.; DEHAK, N.; BRÜMMER, N.; KENNY, P. Comparison of Scoring Methods used in Speaker Recognition with Joint Factor Analysis. Proc. ICASSP 2009. Taipei: IEEE Signal Processing Society, 2009. p. 1-4. ISBN: 978-1-4244-2354-5.
Detail

KOMBRINK, S.; BURGET, L.; MATĚJKA, P.; KARAFIÁT, M.; HEŘMANSKÝ, H. Posterior-based Out of Vocabulary Word Detection in Telephone Speech. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009. p. 80-83. ISSN: 1990-9772.
Detail

BURGET, L.; FAPŠO, M.; HUBEIKA, V.; GLEMBEK, O.; KARAFIÁT, M.; KOCKMANN, M.; MATĚJKA, P.; SCHWARZ, P.; ČERNOCKÝ, J. BUT system for NIST 2008 speaker recognition evaluation. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009. p. 2335-2338. ISBN: 978-1-61567-692-7. ISSN: 1990-9772.
Detail

MATĚJKA, P.; BURGET, L.; GLEMBEK, O.; SCHWARZ, P.; HUBEIKA, V.; FAPŠO, M.; MIKOLOV, T.; PLCHOT, O.; ČERNOCKÝ, J. BUT language recognition system for NIST 2007 evaluations. Proc. Interspeech 2008. Proceedings of Interspeech. Brisbane, Australia: International Speech Communication Association, 2008. p. 1-4. ISSN: 1990-9772.
Detail

HUBEIKA, V.; BURGET, L.; MATĚJKA, P.; SCHWARZ, P. Discriminative Training and Channel Compensation for Acoustic Language Recognition. Proc. Interspeech 2008. Proceedings of Interspeech. Brisbane: International Speech Communication Association, 2008. p. 1-4. ISSN: 1990-9772.
Detail

KOCKMANN, M.; BURGET, L. Syllable based Feature-Contours for Speaker Recognition. Proc. 14th International Workshop on Advances in Speech Technology. Maribor: 2008. p. 1-4.
Detail

GLEMBEK, O.; MATĚJKA, P.; BURGET, L.; MIKOLOV, T. Advances in Phonotactic Language Recognition. Proc. Interspeech 2008. Proceedings of Interspeech. Brisbane: International Speech Communication Association, 2008. p. 1-4. ISSN: 1990-9772.
Detail

SZŐKE, I.; FAPŠO, M.; BURGET, L.; ČERNOCKÝ, J. Hybrid word-subword decoding for spoken term detection. Proc. SSCS 2008: Speech search workshop at SIGIR. Singapore: Association for Computing Machinery, 2008. p. 1-4. ISBN: 978-90-365-2697-5.
Detail

MIKOLOV, T. LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION OF CZECH LECTURES. Proc. STUDENT EEICT 2008. Brno: Faculty of Electrical Engineering and Communication BUT, 2008. p. 1-5. ISBN: 978-80-214-3617-6.
Detail

KARAFIÁT, M.; BURGET, L.; HAIN, T.; ČERNOCKÝ, J. Discrimininative training of narrow band - wide band adaptated systems for meeting recognition. Proc. Interspeech 2008. Proceedings of Interspeech. Brisbane: International Speech Communication Association, 2008. p. 1-4. ISSN: 1990-9772.
Detail

SZŐKE, I.; BURGET, L.; ČERNOCKÝ, J.; FAPŠO, M. Sub-word modeling of out of vocabulary words in spoken term detection. Proc. 2008 IEEE Workshop on Spoken Language Technology. Goa: IEEE Signal Processing Society, 2008. p. 1-4. ISBN: 978-1-4244-3472-5.
Detail

KOCKMANN, M.; BURGET, L. Contour modeling of prosodic and acoustic features for speaker recognition. Proc. 2008 IEEE Workshop on Spoken Language Technology. Goa: IEEE Signal Processing Society, 2008. p. 1-4. ISBN: 978-1-4244-3472-5.
Detail

GRÉZL, F.; KARAFIÁT, M.; BURGET, L. Investigation into bottle-neck features for meeting speech recognition. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009. p. 2947-2950. ISBN: 978-1-61567-692-7. ISSN: 1990-9772.
Detail

BRÜMMER, N.; STRASHEIM, A.; HUBEIKA, V.; MATĚJKA, P.; BURGET, L.; GLEMBEK, O. Discriminative Acoustic Language Recognition via Channel-Compensated GMM Statistics. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009. p. 2187-2190. ISBN: 978-1-61567-692-7. ISSN: 1990-9772.
Detail

KOCKMANN, M.; BURGET, L.; ČERNOCKÝ, J. Brno University of Technology System for Interspeech 2009 Emotion Challenge. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009. p. 348-351. ISSN: 1990-9772.
Detail

BURGET, L.; MATĚJKA, P.; HUBEIKA, V.; ČERNOCKÝ, J. Investigation into variants of Joint Factor Analysis for speaker recognition. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009. p. 1263-1266. ISBN: 978-1-61567-692-7. ISSN: 1990-9772.
Detail

SANTHOSH KUMAR, C.; LI, H.; TONG, R.; MATĚJKA, P.; BURGET, L.; ČERNOCKÝ, J. Tuning phone decoders for language identification. Proc. International Conference on Acoustics, Speech, and Signal Processing 2010. Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010. p. 5010-5013. ISBN: 978-1-4244-4296-6. ISSN: 1520-6149.
Detail

KOCKMANN, M.; BURGET, L.; ČERNOCKÝ, J. Investigations into prosodic syllable contour features for speaker recognition. Proc. International Conference on Acoustics, Speech, and Signal Processing. Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010. p. 4418-4421. ISBN: 978-1-4244-4296-6. ISSN: 1520-6149.
Detail

MIKOLOV, T.; PLCHOT, O.; GLEMBEK, O.; MATĚJKA, P.; BURGET, L.; ČERNOCKÝ, J. PCA-based Feature Extraction for Phonotactic Language Recognition. In Proc. Odyssey 2010 - The Speaker and Language Recognition Workshop. Brno: International Speech Communication Association, 2010. p. 251-255. ISBN: 978-80-214-4114-9.
Detail

HANNEMANN, M.; KOMBRINK, S.; KARAFIÁT, M.; BURGET, L. Similarity Scoring for Recognizing Repeated Out-of-VocabularyWords. Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Proceedings of Interspeech. Makuhari, Chiba: International Speech Communication Association, 2010. p. 897-900. ISBN: 978-1-61782-123-3. ISSN: 1990-9772.
Detail

JANČÍK, Z.; PLCHOT, O.; BRUMMER, J.; BURGET, L.; GLEMBEK, O.; HUBEIKA, V.; KARAFIÁT, M.; MATĚJKA, P.; MIKOLOV, T.; STRASHEIM, A.; ČERNOCKÝ, J. Data selection and calibration issues in automatic language recognition - investigation with BUT-AGNITIO NIST LRE 2009 system. In Proc. Odyssey 2010 - The Speaker and Language Recognition Workshop. Brno: International Speech Communication Association, 2010. p. 215-221. ISBN: 978-80-214-4114-9.
Detail

KARAFIÁT, M.; SZŐKE, I.; ČERNOCKÝ, J. Using Gradient Descent Optimization for Acoustics Training from Heterogeneous Data. Proc. Text, Speech and Dialog 2010. Lecture Notes in Computer Science. LNAI 6231. Brno: Springer Verlag, 2010. p. 322-329. ISBN: 978-3-642-15759-2. ISSN: 0302-9743.
Detail

KOMBRINK, S.; HANNEMANN, M.; BURGET, L.; HEŘMANSKÝ, H. Recovery of Rare Words in Lecture Speech. Proc. Text, Speech and Dialogue 2010. Lecture Notes in Computer Science. Brno: Springer Verlag, 2010. p. 330-337. ISBN: 978-3-642-15759-2. ISSN: 0302-9743.
Detail

KOCKMANN, M.; BURGET, L.; GLEMBEK, O.; FERRER, L.; ČERNOCKÝ, J. Prosodic Speaker Verification using Subspace Multinomial Models with Intersession Compensation. Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Proceedings of Interspeech. Makuhari, Chiba, Japan: International Speech Communication Association, 2010. p. 1061-1064. ISBN: 978-1-61782-123-3. ISSN: 1990-9772.
Detail

VESELÝ, K.; BURGET, L.; GRÉZL, F. Parallel Training of Neural Networks for Speech Recognition. Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Proceedings of Interspeech. Makuhari, Chiba: International Speech Communication Association, 2010. p. 2934-2937. ISSN: 1990-9772.
Detail

KOCKMANN, M.; BURGET, L.; ČERNOCKÝ, J. Brno University of Technology System for Interspeech 2010 Paralinguistic Challenge. Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Proceedings of Interspeech. Makuhari, Chiba: International Speech Communication Association, 2010. p. 2822-2825. ISBN: 978-1-61782-123-3. ISSN: 1990-9772.
Detail

GRÉZL, F.; KARAFIÁT, M. Hierarchical Neural Net Architectures for Feature Extraction in ASR. Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Proceedings of Interspeech. Makuhari, Chiba: International Speech Communication Association, 2010. p. 1201-1204. ISBN: 978-1-61782-123-3. ISSN: 1990-9772.
Detail

HAIN, T.; BURGET, L.; DINES, J.; GARNER, P.; EL HANNANI, A.; HUIJBREGTS, M.; KARAFIÁT, M.; LINCOLN, M.; WAN, V. The AMIDA 2009 Meeting Transcription System. Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Proceedings of Interspeech. Makuhari, Chiba: International Speech Communication Association, 2010. p. 358-361. ISBN: 978-1-61782-123-3. ISSN: 1990-9772.
Detail

SZŐKE, I.; GRÉZL, F.; ČERNOCKÝ, J.; FAPŠO, M. Acoustic keyword spotter - optimization from end-user perspective. Proceedings of the 2010 IEEE Spoken Language Technology Workshop. IEEE Catalog Number: CFP 10SLT-USB. Berkeley, California: IEEE Signal Processing Society, 2010. p. 177-181. ISBN: 978-1-4244-7902-3.
Detail

BRUMMER, J.; BURGET, L.; KENNY, P.; MATĚJKA, P.; DE VILLIERS, E.; KARAFIÁT, M.; KOCKMANN, M.; GLEMBEK, O.; PLCHOT, O.; BAUM, D.; SENOUSSAUOI, M. ABC System description for NIST SRE 2010. Proc. NIST 2010 Speaker Recognition Evaluation. Brno: National Institute of Standards and Technology, 2010. p. 1-20.
Detail

VESELÝ, K.; BURGET, L.; GRÉZL, F. Parallel Training of Neural Networks for Speech Recognition. Prof. Text, Speech and Dialogue 2010. Lecture Notes in Computer Science. LNAI 6231. Brno: Springer Verlag, 2010. p. 439-446. ISBN: 978-3-642-15759-2. ISSN: 0302-9743.
Detail

VESELÝ, K. Parallel training of neural networks for speech recognition. Proceedings of the 16th Conference STUDENT EEICT 2010. Volume 3. Brno: Brno University of Technology, 2010. p. 74-76. ISBN: 978-80-214-4078-4.
Detail

MIKOLOV, T.; KARAFIÁT, M.; BURGET, L.; ČERNOCKÝ, J.; KHUDANPUR, S. Recurrent neural network based language model. Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Proceedings of Interspeech. Makuhari, Chiba: International Speech Communication Association, 2010. p. 1045-1048. ISBN: 978-1-61782-123-3. ISSN: 1990-9772.
Detail

BRÜMMER, N.; BURGET, L.; GLEMBEK, O.; HUBEIKA, V.; JANČÍK, Z.; KARAFIÁT, M.; MATĚJKA, P.; MIKOLOV, T.; PLCHOT, O.; STRASHEIM, A. BUT-AGNITIO System Description for NIST Language Recognition Evaluation 2009. Proceedings NIST 2009 Language Recognition Evaluation Workshop. Baltimore, Maryland, USA: National Institute of Standards and Technology, 2009. p. 1-7.
Detail

GRÉZL, F.; ČERNOCKÝ, J. Audio Surveillance through Known Event Classification. Radioengineering, 2009, vol. 18, no. 4, p. 671-675. ISSN: 1210-2512.
Detail

BURGET, L.; FAPŠO, M.; HUBEIKA, V.; GLEMBEK, O.; KARAFIÁT, M.; KOCKMANN, M.; MATĚJKA, P.; SCHWARZ, P.; ČERNOCKÝ, J. Brno University Of Technology - NIST 2008 SRE. Montreal: 2008. p. 1-28.
Detail

GLEMBEK, O.; BURGET, L.; KENNY, P.; KARAFIÁT, M.; MATĚJKA, P. Simplification and optimization of I-Vector Extraction. Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011. p. 4516-4519. ISBN: 978-1-4577-0537-3.
Detail

DEORAS, A.; MIKOLOV, T.; KOMBRINK, S.; KARAFIÁT, M.; KHUDANPUR, S. Variational Approximation of Long-span Language Models for LVCSR. Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011. p. 5532-5535. ISBN: 978-1-4577-0537-3.
Detail

POVEY, D.; BURGET, L.; AGARWAL, M.; AKYAZI, P.; GHOSHAL, A.; GLEMBEK, O.; GOEL, N.; KARAFIÁT, M.; RASTROW, A.; ROSE, R.; SCHWARZ, P.; THOMAS, S. The subspace Gaussian mixture model-A structured model for speech recognition. COMPUTER SPEECH AND LANGUAGE, 2011, vol. 25, no. 2, p. 404-439. ISSN: 0885-2308.
Detail

BURGET, L.; PLCHOT, O.; CUMANI, S.; GLEMBEK, O.; MATĚJKA, P.; BRÜMMER, N. Discriminatively Trained Probabilistic Linear Discriminant Analysis for Speaker Verification. In Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011. p. 4832-4835. ISBN: 978-1-4577-0537-3.
Detail

CUMANI, S.; BRÜMMER, N.; BURGET, L.; LAFACE, P. Fast Discriminative Speaker Verification in the I-Vector Space. Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011. p. 4852-4855. ISBN: 978-1-4577-0537-3.
Detail

KOCKMANN, M.; FERRER, L.; BURGET, L.; SHRIBERG, E.; ČERNOCKÝ, J. Recent Progress in Prosodic Speaker Verification. Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011. p. 4556-4559. ISBN: 978-1-4577-0537-3.
Detail

MATĚJKA, P.; GLEMBEK, O.; CASTALDO, F.; ALAM, J.; PLCHOT, O.; KENNY, P.; BURGET, L.; ČERNOCKÝ, J. Full-covariance UBM and Heavy-tailed PLDA in I-Vector Speaker Verification. In Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011. p. 4828-4831. ISBN: 978-1-4577-0537-3.
Detail

MIKOLOV, T.; KOMBRINK, S.; BURGET, L.; ČERNOCKÝ, J.; KHUDANPUR, S. Extensions of Recurrent Neural Network Language Model. Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011. p. 5528-5531. ISBN: 978-1-4577-0537-3.
Detail

DEORAS, A.; MIKOLOV, T.; CHURCH, K. A Fast Re-scoring Strategy to Capture Long-Distance Dependencies. Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing July 2011 Edinburgh, Scotland, UK. Edinburgh: Association for Computational Linguistics, 2011. p. 1116-1127. ISBN: 978-1-937284-11-4.
Detail

KOCKMANN, M.; BURGET, L.; ČERNOCKÝ, J. Application of speaker- and language identification state-of-the-art techniques for emotion recognition. Speech Communication, 2011, vol. 53, no. 9, p. 1172-1185. ISSN: 0167-6393.
Detail

GRÉZL, F.; KARAFIÁT, M. Integrating recent MLP feature extraction techniques into TRAP architecture. Proceedings of Interspeech 2011. Proceedings of Interspeech. Florence: International Speech Communication Association, 2011. p. 1229-1232. ISBN: 978-1-61839-270-1. ISSN: 1990-9772.
Detail

MIKOLOV, T.; DEORAS, A.; KOMBRINK, S.; BURGET, L.; ČERNOCKÝ, J. Empirical Evaluation and Combination of Advanced Language Modeling Techniques. Proceedings of Interspeech 2011. Proceedings of Interspeech. Florence: International Speech Communication Association, 2011. p. 605-608. ISBN: 978-1-61839-270-1. ISSN: 1990-9772.
Detail

KOMBRINK, S.; MIKOLOV, T.; KARAFIÁT, M.; BURGET, L. Recurrent Neural Network based Language Modeling in Meeting Recognition. Proceedings of Interspeech 2011. Proceedings of Interspeech. Florence: International Speech Communication Association, 2011. p. 2877-2880. ISBN: 978-1-61839-270-1. ISSN: 1990-9772.
Detail

KARAFIÁT, M.; BURGET, L.; MATĚJKA, P.; GLEMBEK, O.; ČERNOCKÝ, J. iVector-Based Discriminative Adaptation for Automatic Speech Recognition. Proceedings of ASRU 2011. Hilton Waikoloa Village, Big Island, Hawaii: IEEE Signal Processing Society, 2011. p. 152-157. ISBN: 978-1-4673-0366-8.
Detail

VESELÝ, K.; KARAFIÁT, M.; GRÉZL, F. Convolutive Bottleneck Network Features for LVCSR. Proceedings of ASRU 2011. Big Island, Hawaii: IEEE Signal Processing Society, 2011. p. 42-47. ISBN: 978-1-4673-0366-8.
Detail

GRÉZL, F. The Role of Neural Network Size in TRAP/HATS Feature Extraction. Proceedings Text, Speech and Dialogue 2011. Lecture Notes in Computer Science. LNAI 6836. Plzeň: Springer Verlag, 2011. p. 315-322. ISBN: 978-3-642-23537-5. ISSN: 0302-9743.
Detail

KOCKMANN, M.; FERRER, L.; BURGET, L.; ČERNOCKÝ, J. iVector Fusion of Prosodic and Cepstral Features for Speaker Verification. Proceedings of Interspeech 2011. Proceedings of Interspeech. Florence: International Speech Communication Association, 2011. p. 265-268. ISBN: 978-1-61839-270-1. ISSN: 1990-9772.
Detail

BOŘIL, H.; GRÉZL, F.; HANSEN, J. Front-End Compensation Methods for LVCSR Under Lombard Effect. Proceedings of Interspeech 2011. Proceedings of Interspeech. Florence: International Speech Communication Association, 2011. p. 1257-1260. ISBN: 978-1-61839-270-1. ISSN: 1990-9772.
Detail

MIKOLOV, T.; DEORAS, A.; POVEY, D.; BURGET, L.; ČERNOCKÝ, J. Strategies for Training Large Scale Neural Network Language Models. Proceedings of ASRU 2011. Hilton Waikoloa Village, Big Island, Hawaii: IEEE Signal Processing Society, 2011. p. 196-201. ISBN: 978-1-4673-0366-8.
Detail

GRÉZL, F.; KARAFIÁT, M.; JANDA, M. Study of Probabilistic and Bottle-Neck Features in Multilingual Environment. Proceedings of ASRU 2011. Hilton Waikoloa Village, Big Island, Hawaii: IEEE Signal Processing Society, 2011. p. 359-364. ISBN: 978-1-4673-0366-8.
Detail

PEŠÁN, J. Rozpoznávání mluvčího na mobilním telefonu. Proceedings of the 17th Conference Student EEICT 2011. Volume 2. Brno: Vysoké učení technické v Brně, 2011. s. 341-343. ISBN: 978-80-214-4272-6.
Detail

KOMBRINK, S.; MIKOLOV, T. Recurrent Neural Network Language Modeling Applied to the Brno AMI/AMIDA 2009 Meeting Recognizer Setup. Proceedings of the 17th Conference STUDENT EEICT 2011. Volume 3. Brno: Brno University of Technology, 2011. p. 527-531. ISBN: 978-80-214-4273-3.
Detail

POVEY, D.; HANNEMANN, M.; BOULIANNE, G.; BURGET, L.; GHOSHAL, A.; JANDA, M.; KARAFIÁT, M.; KOMBRINK, S.; MOTLÍČEK, P.; QIAN, Y.; RIEDHAMMER, K.; VESELÝ, K.; VU, N. Generating Exact Lattices in The WFST Framework. Proceedings of 2012 IEEE International Conference on Acoustics, Speech and Signal Processing. Kyoto: IEEE Signal Processing Society, 2012. p. 4213-4216. ISBN: 978-1-4673-0044-5.
Detail

CUMANI, S.; PLCHOT, O.; KARAFIÁT, M. Independent Component Analysis and MLLR Transforms for Speaker Identification. Proc. International Conference on Acoustics, Speech, and Signal P. Kyoto: IEEE Signal Processing Society, 2012. p. 4365-4368. ISBN: 978-1-4673-0044-5.
Detail

KOMBRINK, S.; HANNEMANN, M.; BURGET, L. Out-of-Vocabulary Word Detection and Beyond. In Detection and Identification of Rare Audiovisual Cues. Studies in Computational Intelligence, 384. Springer-Verlag Berlin Heidelberg: Springer Verlag, 2012. p. 57-65. ISBN: 978-3-642-24033-1.
Detail

HAIN, T.; BURGET, L.; DINES, J.; GARNER, P.; GRÉZL, F.; EL HANNANI, A.; HUIJBREGTS, M.; KARAFIÁT, M.; LINCOLN, M.; WAN, V. Transcribing Meetings with the AMIDA System. IEEE Transactions on Audio, Speech, and Language Processing, 2012, vol. 20, no. 2, p. 486-498. ISSN: 1558-7916.
Detail

MIKOLOV, T.; KOMBRINK, S.; DEORAS, A.; BURGET, L.; ČERNOCKÝ, J. RNNLM - Recurrent Neural Network Language Modeling Toolkit. Proceedings of ASRU 2011. Hilton Waikoloa Village, Big Island, Hawaii: IEEE Signal Processing Society, 2011. p. 1-4. ISBN: 978-1-4673-0366-8.
Detail

DEORAS, A.; MIKOLOV, T.; KOMBRINK, S.; CHURCH, K. Approximate inference: A sampling based modeling technique to capture complex dependencies in a language model. Speech Communication, 2012, vol. 2012, no. 8, p. 1-16. ISSN: 0167-6393.
Detail

VESELÝ, K.: VUT-SW-Search; Neural Network Trainer TNet. http://speech.fit.vutbr.cz/en/software/neural-network-trainer-tnet. URL: http://speech.fit.vutbr.cz/en/software/neural-network-trainer-tnet. (software)
Detail

Link