Semi-supervised Single-Channel Speech-Music Separation for Automatic Speech Recognition

Demir, Cemil; Cemgil, A. Taylan; Saraclar, Murat

doi:10.81043/aperta.89469

Published January 1, 2011 | Version v1

Conference paper Open

Semi-supervised Single-Channel Speech-Music Separation for Automatic Speech Recognition

1. Bogazici Univ, Dept Comp Engn, Istanbul, Turkey
2. Bogazici Univ, Dept Elect & Elect Engn, Istanbul, Turkey

In this study, we propose a semi-supervised speech-music separation method which uses the speech, music and speech-music segments in a given segmented audio signal to separate speech and music signals from each other in the mixed speech-music segments. In this strategy, we assume, the background music of the mixed signal is partially composed of the repetition of the music segment in the audio. Therefore, we used a mixture model to represent the music signal. The speech signal is modeled using Non-negative Matrix Factorization (NMF) model. The prior model of the template matrix of the NMF model is estimated using the speech segment and updated using the mixed segment of the audio. The separation performance of the proposed method is evaluated in automatic speech recognition task.

Files

bib-230f2e99-ff00-4e64-a252-7a19681829df.txt

Files (244 Bytes)

Name	Size	Download all
bib-230f2e99-ff00-4e64-a252-7a19681829df.txt md5:69d4595af947175f36a6ce361711ed75	244 Bytes	Preview Download

	All versions	This version
Views	72	72
Downloads	52	52
Data volume	12.7 kB	12.7 kB

Semi-supervised Single-Channel Speech-Music Separation for Automatic Speech Recognition

Files

bib-230f2e99-ff00-4e64-a252-7a19681829df.txt

Files (244 Bytes)

TÜBİTAK ULAKBİM

CONTACT

Semi-supervised Single-Channel Speech-Music Separation for Automatic Speech Recognition

Creators

Description

Files

bib-230f2e99-ff00-4e64-a252-7a19681829df.txt

Files (244 Bytes)