Konferans bildirisi Açık Erişim
Iheme, Leonardo O.; Ozan, Sukru; Akagunduz, Erdem
This study presents the development of a voice activity detection (VAD) system tested on call center telephony data obtained from our local site. The concept of bag of audio words (BoAW) combined with a naive Bayes classifier was applied to achieve the task. It was formulated as a binary classification problem with speech as the positive class and silence/background noise as the negative class. All the processing was performed on the Mel-frequency cepstral coefficients (MFCCs) extracted from the audio recordings. The results which are presented as accuracy score and receiver operating characteristics (ROC) indicate an excellent performance of the developed model. The system is to be deployed within our call center to aid data analysis and improve overall efficiency of the center.
Dosya adı | Boyutu | |
---|---|---|
bib-f5254d62-82c8-43d7-a3e8-a22085b8bd8a.txt
md5:fac0405632330264690d29557d3b18c5 |
213 Bytes | İndir |
Görüntülenme | 28 |
İndirme | 8 |
Veri hacmi | 1.7 kB |
Tekil görüntülenme | 26 |
Tekil indirme | 8 |