Detail publikace

Detecting English Speech in the Air Traffic Control Voice Communication

SZŐKE, I. KESIRAJU, S. NOVOTNÝ, O. KOCOUR, M. VESELÝ, K. ČERNOCKÝ, J.

Originální název

Detecting English Speech in the Air Traffic Control Voice Communication

Typ

článek ve sborníku ve WoS nebo Scopus

Jazyk

angličtina

Originální abstrakt

Developing in-cockpit voice enabled applications require a realworlddataset with labels and annotations. We launched a communityplatform for collecting the Air-Traffic Control (ATC)speech, world-wide in the ATCO2 project. Filtering out non-English speech is one of the main components in the data processingpipeline. The proposed English Language Detection(ELD) system is based on the embeddings from Bayesian subspacemultinomial model. It is trained on the word confusionnetwork from an ASR system. It is robust, easy to train, andlight weighted. We achieved 0:0439 equal-error-rate (EER),a 50% relative reduction as compared to the state-of-the-artacoustic ELD system based on x-vectors, in the in-domain scenario.Further, we achieved an EER of 0:1352, a 33% relativereduction as compared to the acoustic ELD, in the unseen language(out-of-domain) condition. We plan to publish the evaluationdataset from the ATCO2 project.

Klíčová slova

speech recognition, language detection, x-vectorextractor, acoustic model, air-traffic communication, data collection, text embeddings, Bayesian methods

Autoři

SZŐKE, I.; KESIRAJU, S.; NOVOTNÝ, O.; KOCOUR, M.; VESELÝ, K.; ČERNOCKÝ, J.

Vydáno

30. 8. 2021

Nakladatel

International Speech Communication Association

Místo

Brno

ISSN

1990-9772

Periodikum

Proceedings of Interspeech

Ročník

2021

Číslo

8

Stát

Francouzská republika

Strany od

3286

Strany do

3290

Strany počet

5

URL

BibTex

@inproceedings{BUT175844,
  author="Igor {Szőke} and Santosh {Kesiraju} and Ondřej {Novotný} and Martin {Kocour} and Karel {Veselý} and Jan {Černocký}",
  title="Detecting English Speech in the Air Traffic Control Voice Communication",
  booktitle="Proceedings Interspeech 2021",
  year="2021",
  journal="Proceedings of Interspeech",
  volume="2021",
  number="8",
  pages="3286--3290",
  publisher="International Speech Communication Association",
  address="Brno",
  doi="10.21437/Interspeech.2021-1033",
  issn="1990-9772",
  url="https://www.isca-speech.org/archive/interspeech_2021/szoke21_interspeech.html"
}

Dokumenty