Project detail

Speech enhancement front-end for robust automatic speech recognition with large amount of training data

Duration: 2.1.2023 — 31.1.2024

Funding resources

Neveřejný sektor - Přímé kontrakty - smluvní výzkum, neveřejné zdroje

On the project

The joint research will aim at investigating and developing speech enhancement and speaker diarization techniques for automatic speech recognition systems that are trained using a large amount of training data.

Description in Czech
Společný výzkum se zaměří na zkoumání a vývoj technik vylepšování řeči a diarizace mluvčího pro systémy automatického rozpoznávání řeči, které jsou trénovány pomocí velkého množství tréninkových dat.

Keywords
speech recognition, speaker diarization, large data, robustness

Key words in Czech
rozpoznávání řeči, diarizace mluvčího, velký objem dat, robustnost

Default language

English

People responsible

Diez Sánchez Mireia, M.Sc., Ph.D. - principal person responsible
Pavlus Ján, Ing. - fellow researcher
Peng Junyi - fellow researcher
Švec Ján, Ing. - fellow researcher

Units

Department of Computer Graphics and Multimedia
- responsible department (9.1.2023 - not assigned)
Speech Data Mining Research Group BUT Speech@FIT
- internal (9.1.2023 - 31.1.2024)
NTT Corporation
- client (9.1.2023 - 31.1.2024)
Department of Computer Graphics and Multimedia
- beneficiary (9.1.2023 - 31.1.2024)

Responsibility: Diez Sánchez Mireia, M.Sc., Ph.D.

VUT

Faculties

University Institutes

Parts

Speech enhancement front-end for robust automatic speech recognition with large amount of training data