Konferans bildirisi Açık Erişim

Deep Neural Decision Forest for Acoustic Scene Classification

Sun, Jianyuan; Liu, Xubo; Mei, Xinhao; Zhao, Jinzheng; Plumbley, Mark D.; Kilic, Volkan; Wang, Wenwu


JSON

{
  "conceptdoi": "10.48623/aperta.259606", 
  "conceptrecid": "259606", 
  "created": "2023-07-29T09:59:32.203681+00:00", 
  "doi": "10.48623/aperta.259607", 
  "files": [
    {
      "bucket": "d1a18fb8-e0f5-4b0b-8264-e0f38ef820d6", 
      "checksum": "md5:ebeb551f753b7da10db35136af049ede", 
      "key": "bib-cb09ede4-9792-4f39-8a6d-fa305bf053be.txt", 
      "links": {
        "self": "https://aperta.ulakbim.gov.tr/api/files/d1a18fb8-e0f5-4b0b-8264-e0f38ef820d6/bib-cb09ede4-9792-4f39-8a6d-fa305bf053be.txt"
      }, 
      "size": 205, 
      "type": "txt"
    }
  ], 
  "id": 259607, 
  "links": {
    "badge": "https://aperta.ulakbim.gov.tr/badge/doi/10.48623/aperta.259607.svg", 
    "bucket": "https://aperta.ulakbim.gov.tr/api/files/d1a18fb8-e0f5-4b0b-8264-e0f38ef820d6", 
    "conceptbadge": "https://aperta.ulakbim.gov.tr/badge/doi/10.48623/aperta.259606.svg", 
    "conceptdoi": "https://doi.org/10.48623/aperta.259606", 
    "doi": "https://doi.org/10.48623/aperta.259607", 
    "html": "https://aperta.ulakbim.gov.tr/record/259607", 
    "latest": "https://aperta.ulakbim.gov.tr/api/records/259607", 
    "latest_html": "https://aperta.ulakbim.gov.tr/record/259607"
  }, 
  "metadata": {
    "access_right": "open", 
    "access_right_category": "success", 
    "communities": [
      {
        "id": "tubitak-destekli-proje-yayinlari"
      }
    ], 
    "creators": [
      {
        "name": "Sun, Jianyuan"
      }, 
      {
        "affiliation": "Univ Surrey, Ctr Vis Speech & Signal Proc CVSSP, Surrey, England", 
        "name": "Liu, Xubo"
      }, 
      {
        "affiliation": "Univ Surrey, Ctr Vis Speech & Signal Proc CVSSP, Surrey, England", 
        "name": "Mei, Xinhao"
      }, 
      {
        "affiliation": "Univ Surrey, Ctr Vis Speech & Signal Proc CVSSP, Surrey, England", 
        "name": "Zhao, Jinzheng"
      }, 
      {
        "affiliation": "Univ Surrey, Ctr Vis Speech & Signal Proc CVSSP, Surrey, England", 
        "name": "Plumbley, Mark D."
      }, 
      {
        "affiliation": "Izmir Katip Celebi Univ, Dept Elect & Elect Engn, Izmir, Turkey", 
        "name": "Kilic, Volkan"
      }, 
      {
        "affiliation": "Univ Surrey, Ctr Vis Speech & Signal Proc CVSSP, Surrey, England", 
        "name": "Wang, Wenwu"
      }
    ], 
    "description": "Acoustic scene classification (ASC) aims to classify an audio clip based on the characteristic of the recording environment. In this regard, deep learning based approaches have emerged as a useful tool for ASC problems. Conventional approaches to improving the classification accuracy include integrating auxiliary methods such as attention mechanism, pre-trained models and ensemble multiple sub-networks. However, due to the complexity of audio clips captured from different environments, it is difficult to distinguish their categories without using any auxiliary methods for existing deep learning models using only a single classifier. In this paper, we propose a novel approach for ASC using deep neural decision forest (DNDF). DNDF combines a fixed number of convolutional layers and a decision forest as the final classifier. The decision forest consists of a fixed number of decision tree classifiers, which have been shown to offer better classification performance than a single classifier in some datasets. In particular, the decision forest differs substantially from traditional random forests as it is stochastic, differentiable, and capable of using the back-propagation to update and learn feature representations in neural network. Experimental results on the DCASE2019 and ESC-50 datasets demonstrate that our proposed DNDF method improves the ASC performance in terms of classification accuracy and shows competitive performance as compared with state-of-the-art baselines.", 
    "doi": "10.48623/aperta.259607", 
    "has_grant": false, 
    "license": {
      "id": "cc-by"
    }, 
    "meeting": {
      "title": "2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022)"
    }, 
    "publication_date": "2022-01-01", 
    "related_identifiers": [
      {
        "identifier": "10.48623/aperta.259606", 
        "relation": "isVersionOf", 
        "scheme": "doi"
      }
    ], 
    "relations": {
      "version": [
        {
          "count": 1, 
          "index": 0, 
          "is_last": true, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "259607"
          }, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "259606"
          }
        }
      ]
    }, 
    "resource_type": {
      "subtype": "conferencepaper", 
      "title": "Konferans bildirisi", 
      "type": "publication"
    }, 
    "science_branches": [
      "Di\u011fer"
    ], 
    "title": "Deep Neural Decision Forest for Acoustic Scene Classification"
  }, 
  "owners": [
    1
  ], 
  "revision": 1, 
  "stats": {
    "downloads": 2.0, 
    "unique_downloads": 2.0, 
    "unique_views": 13.0, 
    "version_downloads": 2.0, 
    "version_unique_downloads": 2.0, 
    "version_unique_views": 13.0, 
    "version_views": 13.0, 
    "version_volume": 410.0, 
    "views": 13.0, 
    "volume": 410.0
  }, 
  "updated": "2023-07-29T09:59:32.267764+00:00"
}
13
2
görüntülenme
indirilme
Tüm sürümler Bu sürüm
Görüntülenme 1313
İndirme 22
Veri hacmi 410 Bytes410 Bytes
Tekil görüntülenme 1313
Tekil indirme 22

Alıntı yap