Konferans bildirisi Açık Erişim

SPEECH DETECTION ON BROADCAST AUDIO

Zubari, Unal; Ozan, Ezgi Can; Acar, Banu Oskay; Ciloglu, Tolga; Esen, Ersin; Ates, Tugrul K.; Onur, Duygu Oskay


MARC21 XML

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nam##2200000uu#4500</leader>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">SPEECH DETECTION ON BROADCAST AUDIO</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.81043/aperta.92905</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <controlfield tag="001">92905</controlfield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-tubitak-adresli-yayinlar</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">Speech boundary detection contributes to performance of speech based applications such as speech recognition and speaker recognition. Speech boundary detector implemented in this study works on broadcast audio as a pre-processor module of a keyword spotter. Speech boundary detection is handled in 3 steps. At first step, audio data is segmented into homogeneous regions in an unsupervised manner. After an ACTIVITY/NON-ACTIVITY decision is made for each region, ACTIVITY regions are classified as Speech/Non-speech via Gaussian Mixture Model (GMM) based classification. GMM's are trained using a novel feature, Spectral Flow Direction (SFD), and an improved multi-band harmonicity feature in addition to widely used Mel Frequency Cepstral Coefficients (MFCC's).</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="2">opendefinition.org</subfield>
    <subfield code="a">cc-by</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Ozan, Ezgi Can</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">TUBITAK UZAY, Video &amp; Audio Proc Grp, TR-06531 Ankara, Turkey</subfield>
    <subfield code="a">Acar, Banu Oskay</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">METU, Dept Elect &amp; Elect Engn, TR-06531 Ankara, Turkey</subfield>
    <subfield code="a">Ciloglu, Tolga</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">TUBITAK UZAY, Video &amp; Audio Proc Grp, TR-06531 Ankara, Turkey</subfield>
    <subfield code="a">Esen, Ersin</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Ates, Tugrul K.</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">TUBITAK UZAY, Video &amp; Audio Proc Grp, TR-06531 Ankara, Turkey</subfield>
    <subfield code="a">Onur, Duygu Oskay</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="b">conferencepaper</subfield>
    <subfield code="a">publication</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">TUBITAK UZAY, Video &amp; Audio Proc Grp, TR-06531 Ankara, Turkey</subfield>
    <subfield code="a">Zubari, Unal</subfield>
  </datafield>
  <datafield tag="711" ind1=" " ind2=" ">
    <subfield code="a">18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010)</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2010-01-01</subfield>
  </datafield>
  <controlfield tag="005">20210316124105.0</controlfield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="a">10.81043/aperta.92904</subfield>
    <subfield code="i">isVersionOf</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="o">oai:zenodo.org:92905</subfield>
    <subfield code="p">user-tubitak-adresli-yayinlar</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="z">md5:8b97a08e268a47179dc0ea9d801acb76</subfield>
    <subfield code="s">177</subfield>
    <subfield code="u">https://aperta.ulakbim.gov.trrecord/92905/files/bib-a2d37fe5-c5aa-4581-9110-d4f29c74467e.txt</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">http://www.opendefinition.org/licenses/cc-by</subfield>
    <subfield code="a">Creative Commons Attribution</subfield>
  </datafield>
</record>
29
10
görüntülenme
indirilme
Görüntülenme 29
İndirme 10
Veri hacmi 1.8 kB
Tekil görüntülenme 28
Tekil indirme 10

Alıntı yap