Detail publikace
Subspace modeling of prosodic features for speaker verification
KOCKMANN, M.
Originální název
Subspace modeling of prosodic features for speaker verification
Typ
dizertace
Jazyk
angličtina
Originální abstrakt
The thesis investigates into speaker verification by means of prosodic features. This includesan appropriate representation of speech by measurements of pitch, energy and durationof speech sounds. Two diverse parameterization methods are investigated: the firstleads to a low-dimensional well-defined set, the second to a large-scale set of heterogeneousprosodic features. The first part of this work concentrates on the development of so calledprosodic contour features. Different modeling techniques are developed and investigated,with a special focus on subspace modeling. The second part focuses on a novel subspacemodeling technique for the heterogeneous large-scale prosodic features. The modelis theoretically derived and experimentally evaluated on official NIST Speaker RecognitionEvaluation tasks. Huge improvements over the current state-of-the-art in prosodic speakerverification were obtained. Eventually, a novel fusion method is presented to elegantlycombine the two diverse prosodic systems. This technique can also be used to fuse thehigher-level systems with a high-performing cepstral system, leading to further significantimprovements.
Klíčová slova
speaker verification, prosody
Autoři
KOCKMANN, M.
Vydáno
21. 5. 2012
Místo
Brno
Strany počet
122
URL
BibTex
@phdthesis{BUT192830,
author="Marcel {Kockmann}",
title="Subspace modeling of prosodic features for speaker verification",
address="Brno",
pages="122",
year="2012",
url="http://www.fit.vutbr.cz/research/groups/speech/publi/2012/phdthesis_kockmann.pdf"
}