Yayınlanmış 1 Ocak 2009 | Sürüm v1
Konferans bildirisi Açık

Speech-Music Segmentation System for Speech Recognition

  • 1. TUBITAK UEKAE, TR-41470 Gebze, Turkey

Açıklama

Using posterior probability based features to segment an audio signal as speech and music has been commonly used method In this study Hidden-Markov-Model (HMM) based acoustic models are used to calculate posterior probabilities. Acoustic Models includes states of context-independent phones as modeling unit. Entropy and Dynamism are found using via the posterior probabilities and these values are used as feature for speech-music discrimination. An HMM based classifier that uses Viterbi decoding is implemented and using discriminative features, audio signals are segmented as speech and music. As a result of the tests, it was found that applied speech-music segmentation method decreases Word-Error-Rate and increases the speed of recognition.

Dosyalar

bib-6ce682ff-322d-4384-81e8-f82fee98e332.txt

Dosyalar (176 Bytes)

Ad Boyut Hepisini indir
md5:c335ed3f045313dfcde2c012d171ebfb
176 Bytes Ön İzleme İndir