ALBERT — All Library Books, journals and Electronic Records Telegrafenberg

Treffer pro Seite

Treffer 1 - 2 | 2 Treffer

Sortierung

Unbekannt

PLDA in the i-supervector space for text-independent speaker verification (2014)

Ye Jiang; Kong Lee; Longbiao Wang

Springer

In: EURASIP Journal on Audio, Speech, and Music Processing

zur Merkliste hinzufügen auf der Merkliste

Details

Publikationsdatum: 2014-07-17

Beschreibung: In this paper, we advocate the use of the uncompressed form of i-vector and depend on subspace modeling using probabilistic linear discriminant analysis (PLDA) in handling the speaker and session (or channel) variability. An i-vector is a low-dimensional vector containing both speaker and channel information acquired from a speech segment. When PLDA is used on an i-vector, dimension reduction is performed twice: first in the i-vector extraction process and second in the PLDA model. Keeping the full dimensionality of the i-vector in the i-supervector space for PLDA modeling and scoring would avoid unnecessary loss of information. We refer to the uncompressed i-vector as the i-supervector. The drawback in using the i-supervector with PLDA is the inversion of large matrices in the estimation of the full posterior distribution, which we show can be solved rather efficiently by portioning large matrices into smaller blocks. We also introduce the Gaussianized rank-norm, as an alternative to whitening, for feature normalization prior to PLDA modeling. We found that the i-supervector performs better during normalization. A better performance is obtained by combining the i-supervector and i-vector at the score level. Furthermore, we also analyze the computational complexity of the i-supervector system, compared with that of the i-vector, at four different stages of loading matrix estimation, posterior extraction, PLDA modeling, and PLDA scoring.

Print ISSN: 1687-4714

Thema: Elektrotechnik, Elektronik, Nachrichtentechnik