Konferans bildirisi Açık Erişim

Deep Neural Decision Forest for Acoustic Scene Classification

Sun, Jianyuan; Liu, Xubo; Mei, Xinhao; Zhao, Jinzheng; Plumbley, Mark D.; Kilic, Volkan; Wang, Wenwu


MARC21 XML

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nam##2200000uu#4500</leader>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Liu, Xubo</subfield>
    <subfield code="u">Univ Surrey, Ctr Vis Speech &amp; Signal Proc CVSSP, Surrey, England</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Mei, Xinhao</subfield>
    <subfield code="u">Univ Surrey, Ctr Vis Speech &amp; Signal Proc CVSSP, Surrey, England</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Zhao, Jinzheng</subfield>
    <subfield code="u">Univ Surrey, Ctr Vis Speech &amp; Signal Proc CVSSP, Surrey, England</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Plumbley, Mark D.</subfield>
    <subfield code="u">Univ Surrey, Ctr Vis Speech &amp; Signal Proc CVSSP, Surrey, England</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Kilic, Volkan</subfield>
    <subfield code="u">Izmir Katip Celebi Univ, Dept Elect &amp; Elect Engn, Izmir, Turkey</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Wang, Wenwu</subfield>
    <subfield code="u">Univ Surrey, Ctr Vis Speech &amp; Signal Proc CVSSP, Surrey, England</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-tubitak-destekli-proje-yayinlari</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="a">Creative Commons Attribution</subfield>
    <subfield code="u">http://www.opendefinition.org/licenses/cc-by</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.48623/aperta.259606</subfield>
    <subfield code="n">doi</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.48623/aperta.259607</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Deep Neural Decision Forest for Acoustic Scene Classification</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="a">Sun, Jianyuan</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="o">oai:aperta.ulakbim.gov.tr:259607</subfield>
    <subfield code="p">user-tubitak-destekli-proje-yayinlari</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="2">opendefinition.org</subfield>
    <subfield code="a">cc-by</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2022-01-01</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="u">https://aperta.ulakbim.gov.trrecord/259607/files/bib-cb09ede4-9792-4f39-8a6d-fa305bf053be.txt</subfield>
    <subfield code="z">md5:ebeb551f753b7da10db35136af049ede</subfield>
    <subfield code="s">205</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <controlfield tag="005">20230729095932.0</controlfield>
  <controlfield tag="001">259607</controlfield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">publication</subfield>
    <subfield code="b">conferencepaper</subfield>
  </datafield>
  <datafield tag="711" ind1=" " ind2=" ">
    <subfield code="a">2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022)</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">Acoustic scene classification (ASC) aims to classify an audio clip based on the characteristic of the recording environment. In this regard, deep learning based approaches have emerged as a useful tool for ASC problems. Conventional approaches to improving the classification accuracy include integrating auxiliary methods such as attention mechanism, pre-trained models and ensemble multiple sub-networks. However, due to the complexity of audio clips captured from different environments, it is difficult to distinguish their categories without using any auxiliary methods for existing deep learning models using only a single classifier. In this paper, we propose a novel approach for ASC using deep neural decision forest (DNDF). DNDF combines a fixed number of convolutional layers and a decision forest as the final classifier. The decision forest consists of a fixed number of decision tree classifiers, which have been shown to offer better classification performance than a single classifier in some datasets. In particular, the decision forest differs substantially from traditional random forests as it is stochastic, differentiable, and capable of using the back-propagation to update and learn feature representations in neural network. Experimental results on the DCASE2019 and ESC-50 datasets demonstrate that our proposed DNDF method improves the ASC performance in terms of classification accuracy and shows competitive performance as compared with state-of-the-art baselines.</subfield>
  </datafield>
</record>
13
2
görüntülenme
indirilme
Tüm sürümler Bu sürüm
Görüntülenme 1313
İndirme 22
Veri hacmi 410 Bytes410 Bytes
Tekil görüntülenme 1313
Tekil indirme 22

Alıntı yap