Publication detail

Toroidal Probabilistic Spherical Discriminant Analysis

SILNOVA, A. BRUMMER, J. SWART, A. BURGET, L.

Original Title

Toroidal Probabilistic Spherical Discriminant Analysis

Type

conference paper

Language

English

Original Abstract

n speaker recognition, where speech segments are mapped to embeddings on the unit hypersphere, two scoring back-ends are commonly used, namely cosine scoring and PLDA. We have recently proposed PSDA, an analog to PLDA that uses Von Mises-Fisher distributions instead of Gaussians. In this paper, we present toroidal PSDA (T-PSDA). It extends PSDA with the ability to model within and between-speaker variabilities in toroidal submanifolds of the hypersphere. Like PLDA and PSDA, the model allows closed-form scoring and closed-form EM updates for training. On VoxCeleb, we find T-PSDA accu- racy on par with cosine scoring, while PLDA accuracy is infe- rior. On NIST SRE'21 we find that T-PSDA gives large accu- racy gains compared to both cosine scoring and PLDA.

Keywords

speaker recognition, PSDA, Von Mises-Fishe

Authors

SILNOVA, A.; BRUMMER, J.; SWART, A.; BURGET, L.

Released

4. 6. 2023

Publisher

IEEE Signal Processing Society

Location

Rhodes Island

ISBN

978-1-7281-6327-7

Book

Proceedings of ICASSP 2023

Pages from

1

Pages to

5

Pages count

5

URL

BibTex

@inproceedings{BUT185199,
  author="Anna {Silnova} and Johan Nikolaas Langenhoven {Brummer} and Albert du Preez {Swart} and Lukáš {Burget}",
  title="Toroidal Probabilistic Spherical Discriminant Analysis",
  booktitle="Proceedings of ICASSP 2023",
  year="2023",
  pages="1--5",
  publisher="IEEE Signal Processing Society",
  address="Rhodes Island",
  doi="10.1109/ICASSP49357.2023.10095580",
  isbn="978-1-7281-6327-7",
  url="https://ieeexplore.ieee.org/document/10095580"
}

Documents