Detail publikace
BUT system for DIHARD Speech Diarization Challenge 2018
DIEZ SÁNCHEZ, M. LANDINI, F. BURGET, L. ROHDIN, J. SILNOVA, A. ŽMOLÍKOVÁ, K. NOVOTNÝ, O. VESELÝ, K. GLEMBEK, O. PLCHOT, O. MOŠNER, L. MATĚJKA, P.
Originální název
BUT system for DIHARD Speech Diarization Challenge 2018
Typ
článek ve sborníku ve WoS nebo Scopus
Jazyk
angličtina
Originální abstrakt
This paper presents the approach developed by the BUT teamfor the first DIHARD speech diarization challenge, which isbased on our Bayesian Hidden Markov Model with eigenvoicepriors system. Besides the description of the approach, we providea brief analysis of different techniques and data processingmethods tested on the development set. We also introducea simple attempt for overlapped speech detection that we usedfor attaining cleaner speaker models and reassigning overlappedspeech to multiple speakers. Finally, we present results obtainedon the evaluation set and discuss findings we made during thedevelopment phase and with the help of the DIHARD leaderboardfeedback.
Klíčová slova
Speaker Diarization, Variational Bayes, HMM,i-vector, x-vector, Overlapped speech, DIHARD
Autoři
DIEZ SÁNCHEZ, M.; LANDINI, F.; BURGET, L.; ROHDIN, J.; SILNOVA, A.; ŽMOLÍKOVÁ, K.; NOVOTNÝ, O.; VESELÝ, K.; GLEMBEK, O.; PLCHOT, O.; MOŠNER, L.; MATĚJKA, P.
Vydáno
2. 9. 2018
Nakladatel
International Speech Communication Association
Místo
Hyderabad
ISSN
1990-9772
Periodikum
Proceedings of Interspeech
Ročník
2018
Číslo
9
Stát
Francouzská republika
Strany od
2798
Strany do
2802
Strany počet
5
URL
BibTex
@inproceedings{BUT155100,
author="Mireia {Diez Sánchez} and Federico Nicolás {Landini} and Lukáš {Burget} and Johan Andréas {Rohdin} and Anna {Silnova} and Kateřina {Žmolíková} and Ondřej {Novotný} and Karel {Veselý} and Ondřej {Glembek} and Oldřich {Plchot} and Ladislav {Mošner} and Pavel {Matějka}",
title="BUT system for DIHARD Speech Diarization Challenge 2018",
booktitle="Proceedings of Interspeech 2018",
year="2018",
journal="Proceedings of Interspeech",
volume="2018",
number="9",
pages="2798--2802",
publisher="International Speech Communication Association",
address="Hyderabad",
doi="10.21437/Interspeech.2018-1749",
issn="1990-9772",
url="https://www.isca-speech.org/archive/Interspeech_2018/abstracts/1749.html"
}
Dokumenty