Catalog-Based Single-Channel Speech-Music Separation

Demir, Cemil; Cemgil, A. Taylan; Saraclar, Murat

doi:10.81043/aperta.89641

Yayınlanmış 1 Ocak 2010 | Sürüm v1

Konferans bildirisi Açık

Catalog-Based Single-Channel Speech-Music Separation

1. Bogazici Univ, Dept Comp Engn, Istanbul, Turkey
2. Bogazici Univ, Elect & Elect Engn Dept, Istanbul, Turkey

We propose a new catalog-based speech-music separation method for background music removal. Assuming that we know a catalog of the background music, we develop a generative model for the superposed speech and music spectrograms. We represent the speech spectrogram by a Non-negative Matrix Factorization (NMF) model and the music spectrogram by a conditional Poisson Mixture Model (PMM). By choosing the size of the catalog, i.e., the number of mixture components we can tradeoff speed versus accuracy. The combined hierarchical model leads to a mixture of multinomial distributions as the joint posterior of music and speech. Separation and hyper-parameter adaptation can be achieved via an Expectation Maximization algorithm. Experimental results show that separation performance of the algorithm is promising. Furthermore, we show that incorporating prior information such as volume adjustment parameter can enhance the separation performance.

Dosyalar

bib-363a5b89-1b16-4244-b902-53dc34c59283.txt

Dosyalar (213 Bytes)

Ad	Boyut	Hepisini indir
bib-363a5b89-1b16-4244-b902-53dc34c59283.txt md5:c9d281603d5f84eb267b4ba7e0c9adad	213 Bytes	Ön İzleme İndir

	Tüm sürümler	Bu sürüm
Görüntüleme	46	46
İndirilenler	13	13
Veri miktarı	2.8 kB	2.8 kB

Catalog-Based Single-Channel Speech-Music Separation

Dosyalar

bib-363a5b89-1b16-4244-b902-53dc34c59283.txt

Dosyalar (213 Bytes)

TÜBİTAK ULAKBİM

İLETİŞİM

Catalog-Based Single-Channel Speech-Music Separation

Oluşturanlar

Açıklama

Dosyalar

bib-363a5b89-1b16-4244-b902-53dc34c59283.txt

Dosyalar (213 Bytes)