Konferans bildirisi Açık Erişim
Zubari, Unal; Ozan, Ezgi Can; Acar, Banu Oskay; Ciloglu, Tolga; Esen, Ersin; Ates, Tugrul K.; Onur, Duygu Oskay
<?xml version='1.0' encoding='UTF-8'?> <record xmlns="http://www.loc.gov/MARC21/slim"> <leader>00000nam##2200000uu#4500</leader> <datafield tag="245" ind1=" " ind2=" "> <subfield code="a">SPEECH DETECTION ON BROADCAST AUDIO</subfield> </datafield> <datafield tag="024" ind1=" " ind2=" "> <subfield code="a">10.81043/aperta.92905</subfield> <subfield code="2">doi</subfield> </datafield> <controlfield tag="001">92905</controlfield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">user-tubitak-adresli-yayinlar</subfield> </datafield> <datafield tag="520" ind1=" " ind2=" "> <subfield code="a">Speech boundary detection contributes to performance of speech based applications such as speech recognition and speaker recognition. Speech boundary detector implemented in this study works on broadcast audio as a pre-processor module of a keyword spotter. Speech boundary detection is handled in 3 steps. At first step, audio data is segmented into homogeneous regions in an unsupervised manner. After an ACTIVITY/NON-ACTIVITY decision is made for each region, ACTIVITY regions are classified as Speech/Non-speech via Gaussian Mixture Model (GMM) based classification. GMM's are trained using a novel feature, Spectral Flow Direction (SFD), and an improved multi-band harmonicity feature in addition to widely used Mel Frequency Cepstral Coefficients (MFCC's).</subfield> </datafield> <datafield tag="650" ind1="1" ind2="7"> <subfield code="2">opendefinition.org</subfield> <subfield code="a">cc-by</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Ozan, Ezgi Can</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="u">TUBITAK UZAY, Video & Audio Proc Grp, TR-06531 Ankara, Turkey</subfield> <subfield code="a">Acar, Banu Oskay</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="u">METU, Dept Elect & Elect Engn, TR-06531 Ankara, Turkey</subfield> <subfield code="a">Ciloglu, Tolga</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="u">TUBITAK UZAY, Video & Audio Proc Grp, TR-06531 Ankara, Turkey</subfield> <subfield code="a">Esen, Ersin</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Ates, Tugrul K.</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="u">TUBITAK UZAY, Video & Audio Proc Grp, TR-06531 Ankara, Turkey</subfield> <subfield code="a">Onur, Duygu Oskay</subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="b">conferencepaper</subfield> <subfield code="a">publication</subfield> </datafield> <datafield tag="542" ind1=" " ind2=" "> <subfield code="l">open</subfield> </datafield> <datafield tag="100" ind1=" " ind2=" "> <subfield code="u">TUBITAK UZAY, Video & Audio Proc Grp, TR-06531 Ankara, Turkey</subfield> <subfield code="a">Zubari, Unal</subfield> </datafield> <datafield tag="711" ind1=" " ind2=" "> <subfield code="a">18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010)</subfield> </datafield> <datafield tag="260" ind1=" " ind2=" "> <subfield code="c">2010-01-01</subfield> </datafield> <controlfield tag="005">20210316124105.0</controlfield> <datafield tag="773" ind1=" " ind2=" "> <subfield code="n">doi</subfield> <subfield code="a">10.81043/aperta.92904</subfield> <subfield code="i">isVersionOf</subfield> </datafield> <datafield tag="909" ind1="C" ind2="O"> <subfield code="o">oai:zenodo.org:92905</subfield> <subfield code="p">user-tubitak-adresli-yayinlar</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="z">md5:8b97a08e268a47179dc0ea9d801acb76</subfield> <subfield code="s">177</subfield> <subfield code="u">https://aperta.ulakbim.gov.trrecord/92905/files/bib-a2d37fe5-c5aa-4581-9110-d4f29c74467e.txt</subfield> </datafield> <datafield tag="540" ind1=" " ind2=" "> <subfield code="u">http://www.opendefinition.org/licenses/cc-by</subfield> <subfield code="a">Creative Commons Attribution</subfield> </datafield> </record>
Görüntülenme | 29 |
İndirme | 10 |
Veri hacmi | 1.8 kB |
Tekil görüntülenme | 28 |
Tekil indirme | 10 |