Publication detail
Subspace modeling of prosodic features for speaker verification
KOCKMANN, M.
Original Title
Subspace modeling of prosodic features for speaker verification
Type
dissertation
Language
English
Original Abstract
The thesis investigates into speaker verification by means of prosodic features. This includesan appropriate representation of speech by measurements of pitch, energy and durationof speech sounds. Two diverse parameterization methods are investigated: the firstleads to a low-dimensional well-defined set, the second to a large-scale set of heterogeneousprosodic features. The first part of this work concentrates on the development of so calledprosodic contour features. Different modeling techniques are developed and investigated,with a special focus on subspace modeling. The second part focuses on a novel subspacemodeling technique for the heterogeneous large-scale prosodic features. The modelis theoretically derived and experimentally evaluated on official NIST Speaker RecognitionEvaluation tasks. Huge improvements over the current state-of-the-art in prosodic speakerverification were obtained. Eventually, a novel fusion method is presented to elegantlycombine the two diverse prosodic systems. This technique can also be used to fuse thehigher-level systems with a high-performing cepstral system, leading to further significantimprovements.
Keywords
speaker verification, prosody
Authors
KOCKMANN, M.
Released
21. 5. 2012
Location
Brno
Pages count
122
URL
BibTex
@phdthesis{BUT192830,
author="Marcel {Kockmann}",
title="Subspace modeling of prosodic features for speaker verification",
address="Brno",
pages="122",
year="2012",
url="http://www.fit.vutbr.cz/research/groups/speech/publi/2012/phdthesis_kockmann.pdf"
}