Publication detail

BUT system for DIHARD Speech Diarization Challenge 2018

DIEZ SÁNCHEZ, M. LANDINI, F. BURGET, L. ROHDIN, J. SILNOVA, A. ŽMOLÍKOVÁ, K. NOVOTNÝ, O. VESELÝ, K. GLEMBEK, O. PLCHOT, O. MOŠNER, L. MATĚJKA, P.

Original Title

BUT system for DIHARD Speech Diarization Challenge 2018

Type

conference paper

Language

English

Original Abstract

This paper presents the approach developed by the BUT team for the first DIHARD speech diarization challenge, which is based on our Bayesian Hidden Markov Model with eigenvoice priors system. Besides the description of the approach, we provide a brief analysis of different techniques and data processing methods tested on the development set. We also introduce a simple attempt for overlapped speech detection that we used for attaining cleaner speaker models and reassigning overlapped speech to multiple speakers. Finally, we present results obtained on the evaluation set and discuss findings we made during the development phase and with the help of the DIHARD leaderboard feedback.

Keywords

Speaker Diarization, Variational Bayes, HMM, i-vector, x-vector, Overlapped speech, DIHARD

Authors

DIEZ SÁNCHEZ, M.; LANDINI, F.; BURGET, L.; ROHDIN, J.; SILNOVA, A.; ŽMOLÍKOVÁ, K.; NOVOTNÝ, O.; VESELÝ, K.; GLEMBEK, O.; PLCHOT, O.; MOŠNER, L.; MATĚJKA, P.

Released

2. 9. 2018

Publisher

International Speech Communication Association

Location

Hyderabad

ISBN

1990-9772

Periodical

Proceedings of Interspeech

Year of study

2018

Number

9

State

French Republic

Pages from

2798

Pages to

2802

Pages count

5

URL

BibTex

@inproceedings{BUT155100,
  author="Mireia {Diez Sánchez} and Federico Nicolás {Landini} and Lukáš {Burget} and Johan Andréas {Rohdin} and Anna {Silnova} and Kateřina {Žmolíková} and Ondřej {Novotný} and Karel {Veselý} and Ondřej {Glembek} and Oldřich {Plchot} and Ladislav {Mošner} and Pavel {Matějka}",
  title="BUT system for DIHARD Speech Diarization Challenge 2018",
  booktitle="Proceedings of Interspeech 2018",
  year="2018",
  journal="Proceedings of Interspeech",
  volume="2018",
  number="9",
  pages="2798--2802",
  publisher="International Speech Communication Association",
  address="Hyderabad",
  doi="10.21437/Interspeech.2018-1749",
  issn="1990-9772",
  url="https://www.isca-speech.org/archive/Interspeech_2018/abstracts/1749.html"
}