Přístupnostní navigace
E-přihláška
Vyhledávání Vyhledat Zavřít
Detail publikace
MATĚJKA, P. PLCHOT, O. GLEMBEK, O. BURGET, L. ROHDIN, J. ZEINALI, H. MOŠNER, L. SILNOVA, A. NOVOTNÝ, O. DIEZ SÁNCHEZ, M. ČERNOCKÝ, J.
Originální název
13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE
Typ
článek v časopise ve Web of Science, Jimp
Jazyk
angličtina
Originální abstrakt
In this paper, we present a brief history and a "longitudinal study" of all important milestone modelling techniques used in text independent speaker recognition since Brno University of Technology (BUT) first participated in the NIST Speaker Recognition Evaluation (SRE) in 2006-GMM MAP, GMM MAP with eigen-channel adaptation, Joint Factor Analysis, i-vector and DNN embedding (x-vector). To emphasize the historical context, the techniques are evaluated on all NIST SRE sets since 2004 on a time-machine principle, i.e. a system is always trained using all data available up till the year of evaluation. Moreover, as user-contributed audiovisual content dominates nowadays Internet, we representatively include the Speakers In The Wild (SITW) and VOiCES challenge datasets in the evaluation of our systems. Not only we present a comparison of the modelling techniques, but we also show the effect of sampling frequency.
Klíčová slova
Speaker recognition, NIST, Evaluations, GMM, Eigen-channel, compensation, JFA, I-vectors, DNN Embedding, X-vectors
Autoři
MATĚJKA, P.; PLCHOT, O.; GLEMBEK, O.; BURGET, L.; ROHDIN, J.; ZEINALI, H.; MOŠNER, L.; SILNOVA, A.; NOVOTNÝ, O.; DIEZ SÁNCHEZ, M.; ČERNOCKÝ, J.
Vydáno
1. 9. 2020
ISSN
0885-2308
Periodikum
COMPUTER SPEECH AND LANGUAGE
Ročník
2020
Číslo
63
Stát
Spojené království Velké Británie a Severního Irska
Strany od
1
Strany do
15
Strany počet
URL
https://www.sciencedirect.com/science/article/pii/S0885230819302797?via%3Dihub
BibTex
@article{BUT162674, author="Pavel {Matějka} and Oldřich {Plchot} and Ondřej {Glembek} and Lukáš {Burget} and Johan Andréas {Rohdin} and Hossein {Zeinali} and Ladislav {Mošner} and Anna {Silnova} and Ondřej {Novotný} and Mireia {Diez Sánchez} and Jan {Černocký}", title="13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE", journal="COMPUTER SPEECH AND LANGUAGE", year="2020", volume="2020", number="63", pages="1--15", doi="10.1016/j.csl.2019.101035", issn="0885-2308", url="https://www.sciencedirect.com/science/article/pii/S0885230819302797?via%3Dihub" }