Combining log-spectral mean subtraction at different frequency resolutions for handset-channel compensation in single utterance speaker verification

Buyuk, O.; Arslan, L. M.

doi:10.1049/iet-spr.2011.0270

Published January 1, 2012 | Version v1

Journal article Open

Combining log-spectral mean subtraction at different frequency resolutions for handset-channel compensation in single utterance speaker verification

1. Bogazici Univ, Dept Elect & Elect Engn, Istanbul, Turkey

Cepstral mean subtraction (CMS) is a well-known feature domain channel compensation technique employed to eliminate the effects of convolutive channel distortion. However, as the authors use in log-spectral mean subtraction (LSMS), the compensation might be applied in spectral domain before the filter-bank analysis with a higher-frequency resolution. LSMS can also be combined with CMS to further improve the recognition performance. In this study, the authors compare the performances of LSMS and CMS methods using a multi-channel, text-dependent single utterance speaker recognition database. In the experiments, the authors observe that LSMS outperforms CMS especially in the high false acceptance region. Moreover, the accuracy is further improved when the methods are combined together. With the combination, the authors achieve 15.5% relative reduction in equal error rate for no score normalisation and 9.4% for test normalisation cases when compared with the baseline CMS experiment.

Files

bib-f36fd8e7-dbb7-4ae3-8a67-b63e9ffb33b0.txt

Files (218 Bytes)

Name	Size	Download all
bib-f36fd8e7-dbb7-4ae3-8a67-b63e9ffb33b0.txt md5:dd1fea19be0bf5928f4238d0e0588ed4	218 Bytes	Preview Download

	All versions	This version
Views	34	34
Downloads	11	11
Data volume	2.4 kB	2.4 kB

Combining log-spectral mean subtraction at different frequency resolutions for handset-channel compensation in single utterance speaker verification

Files

bib-f36fd8e7-dbb7-4ae3-8a67-b63e9ffb33b0.txt

Files (218 Bytes)

TÜBİTAK ULAKBİM

CONTACT

Combining log-spectral mean subtraction at different frequency resolutions for handset-channel compensation in single utterance speaker verification

Creators

Description

Files

bib-f36fd8e7-dbb7-4ae3-8a67-b63e9ffb33b0.txt

Files (218 Bytes)