Yayınlanmış 1 Ocak 2010
| Sürüm v1
Konferans bildirisi
Açık
Catalog-Based Single-Channel Speech-Music Separation
Oluşturanlar
- 1. Bogazici Univ, Dept Comp Engn, Istanbul, Turkey
- 2. Bogazici Univ, Elect & Elect Engn Dept, Istanbul, Turkey
Açıklama
We propose a new catalog-based speech-music separation method for background music removal. Assuming that we know a catalog of the background music, we develop a generative model for the superposed speech and music spectrograms. We represent the speech spectrogram by a Non-negative Matrix Factorization (NMF) model and the music spectrogram by a conditional Poisson Mixture Model (PMM). By choosing the size of the catalog, i.e., the number of mixture components we can tradeoff speed versus accuracy. The combined hierarchical model leads to a mixture of multinomial distributions as the joint posterior of music and speech. Separation and hyper-parameter adaptation can be achieved via an Expectation Maximization algorithm. Experimental results show that separation performance of the algorithm is promising. Furthermore, we show that incorporating prior information such as volume adjustment parameter can enhance the separation performance.
Dosyalar
bib-363a5b89-1b16-4244-b902-53dc34c59283.txt
Dosyalar
(213 Bytes)
| Ad | Boyut | Hepisini indir |
|---|---|---|
|
md5:c9d281603d5f84eb267b4ba7e0c9adad
|
213 Bytes | Ön İzleme İndir |