Konferans bildirisi Açık Erişim
Zubari, Unal; Ozan, Ezgi Can; Acar, Banu Oskay; Ciloglu, Tolga; Esen, Ersin; Ates, Tugrul K.; Onur, Duygu Oskay
{ "URL": "https://aperta.ulakbim.gov.tr/record/92905", "abstract": "Speech boundary detection contributes to performance of speech based applications such as speech recognition and speaker recognition. Speech boundary detector implemented in this study works on broadcast audio as a pre-processor module of a keyword spotter. Speech boundary detection is handled in 3 steps. At first step, audio data is segmented into homogeneous regions in an unsupervised manner. After an ACTIVITY/NON-ACTIVITY decision is made for each region, ACTIVITY regions are classified as Speech/Non-speech via Gaussian Mixture Model (GMM) based classification. GMM's are trained using a novel feature, Spectral Flow Direction (SFD), and an improved multi-band harmonicity feature in addition to widely used Mel Frequency Cepstral Coefficients (MFCC's).", "author": [ { "family": "Zubari", "given": " Unal" }, { "family": "Ozan", "given": " Ezgi Can" }, { "family": "Acar", "given": " Banu Oskay" }, { "family": "Ciloglu", "given": " Tolga" }, { "family": "Esen", "given": " Ersin" }, { "family": "Ates", "given": " Tugrul K." }, { "family": "Onur", "given": " Duygu Oskay" } ], "id": "92905", "issued": { "date-parts": [ [ 2010, 1, 1 ] ] }, "title": "SPEECH DETECTION ON BROADCAST AUDIO", "type": "paper-conference" }
Görüntülenme | 29 |
İndirme | 10 |
Veri hacmi | 1.8 kB |
Tekil görüntülenme | 28 |
Tekil indirme | 10 |