CATALOG-BASED SINGLE-CHANNEL SPEECH-MUSIC SEPARATION FOR AUTOMATIC SPEECH RECOGNITION

Demir, Cemil; Cemgil, A. Taylan; Saraclar, Murat

doi:10.48623/aperta.92323

Yayınlanmış 1 Ocak 2011 | Sürüm v1

Konferans bildirisi Açık

CATALOG-BASED SINGLE-CHANNEL SPEECH-MUSIC SEPARATION FOR AUTOMATIC SPEECH RECOGNITION

1. Bogazici Univ, Dept Comp Engn, Istanbul, Turkey
2. Bogazici Univ, Elect & Elect Engn Dept, Istanbul, Turkey

In this study, we analyze the effect of the catalog-based single-channel speech-music separation method, which we proposed previously, on speech recognition performance. In the proposed method, assuming that we know a catalog of the background music, we developed a generative model for the superposed speech and music spectrograms. We represent the speech spectrogram by a Non-negative Matrix Factorization (NMF) model and the music spectrogram by a conditional Poisson Mixture Model (PMM). In this paper, we propose to recover the speech signals from the mixed signal in time-domain by detecting the active catalog frames using the catalog-based method. We compare the performances of 3 different signal reconstruction techniques; Expectation Based, Posterior-Based and Time-Domain reconstruction. Moreover, we compare the performance of our system with the performance of the traditional NMF model. Our method outperforms the NMF method in ASR performance and separation performance in most experimental conditions.

Dosyalar

bib-76347895-0250-498f-a744-216f1f3b66eb.txt

Dosyalar (189 Bytes)

Ad	Boyut	Hepisini indir
bib-76347895-0250-498f-a744-216f1f3b66eb.txt md5:1a06067e9cd82606f44dd9d36cbca54b	189 Bytes	Ön İzleme İndir

	Tüm sürümler	Bu sürüm
Görüntüleme	82	82
İndirilenler	19	19
Veri miktarı	3.6 kB	3.6 kB

CATALOG-BASED SINGLE-CHANNEL SPEECH-MUSIC SEPARATION FOR AUTOMATIC SPEECH RECOGNITION

Dosyalar

bib-76347895-0250-498f-a744-216f1f3b66eb.txt

Dosyalar (189 Bytes)

TÜBİTAK ULAKBİM

İLETİŞİM

CATALOG-BASED SINGLE-CHANNEL SPEECH-MUSIC SEPARATION FOR AUTOMATIC SPEECH RECOGNITION

Oluşturanlar

Açıklama

Dosyalar

bib-76347895-0250-498f-a744-216f1f3b66eb.txt

Dosyalar (189 Bytes)