Project detail

NTTC-Speech enhancement front-end for robust automatic speech recognition with large amount of training data

Duration: 1.2.2025 — 31.1.2026

Funding resources

Neveřejný sektor - Přímé kontrakty - smluvní výzkum, neveřejné zdroje

On the project

The joint research will aim at investigating and developing speech enhancement and speaker diarization techniques for automatic speech recognition systems that are trained using a large amount of training data.

Description in Czech
Společný výzkum se zaměří na zkoumání a vývoj technik vylepšování řeči a diarizace mluvčího pro systémy automatického rozpoznávání řeči, které jsou trénovány pomocí velkého množství tréninkových dat.

Keywords
speech recognition, speaker diarization, large data, robustness

Default language

English

People responsible

Burget Lukáš, doc. Ing., Ph.D. - principal person responsible
Klement Dominik, Bc. - fellow researcher
Pálka Petr, Bc. - fellow researcher
Pavlus Ján, Ing. - fellow researcher

Units

Department of Computer Graphics and Multimedia
- responsible department (15.1.2025 - not assigned)
Speech Data Mining Research Group BUT Speech@FIT
- internal (15.1.2025 - 31.1.2026)
NTT Corporation
- client (15.1.2025 - 31.1.2026)
Department of Computer Graphics and Multimedia
- beneficiary (15.1.2025 - 31.1.2026)

Responsibility: Burget Lukáš, doc. Ing., Ph.D.

VUT

Faculties

University Institutes

Parts

NTTC-Speech enhancement front-end for robust automatic speech recognition with large amount of training data