Konferans bildirisi Açık Erişim

DIAGNOSIS OF DIABETES DISEASE USING MACHINE LEARNING METHODS IN AN IMBALANCED DIABETES DATASET

İsmail Buğra Bölükbaşı; Betül Yağmahan


JSON

{
  "conceptdoi": "10.48623/aperta.286135", 
  "conceptrecid": "286135", 
  "created": "2025-07-31T16:13:44.683492+00:00", 
  "doi": "10.48623/aperta.286136", 
  "files": [
    {
      "bucket": "ac0de865-bc5a-465f-af18-a64e3eba17ca", 
      "checksum": "md5:7263bbe549773ced2254b23b669f0165", 
      "key": "DIAGNOSIS OF DIABETES DISEASE USING MACHINE LEARNING METHODS IN AN IMBALANCED DIABETES DATASET.pdf", 
      "links": {
        "self": "https://aperta.ulakbim.gov.tr/api/files/ac0de865-bc5a-465f-af18-a64e3eba17ca/DIAGNOSIS%20OF%20DIABETES%20DISEASE%20USING%20MACHINE%20LEARNING%20METHODS%20IN%20AN%20IMBALANCED%20DIABETES%20DATASET.pdf"
      }, 
      "size": 270011, 
      "type": "pdf"
    }
  ], 
  "id": 286136, 
  "links": {
    "badge": "https://aperta.ulakbim.gov.tr/badge/doi/10.48623/aperta.286136.svg", 
    "bucket": "https://aperta.ulakbim.gov.tr/api/files/ac0de865-bc5a-465f-af18-a64e3eba17ca", 
    "conceptbadge": "https://aperta.ulakbim.gov.tr/badge/doi/10.48623/aperta.286135.svg", 
    "conceptdoi": "https://doi.org/10.48623/aperta.286135", 
    "doi": "https://doi.org/10.48623/aperta.286136", 
    "html": "https://aperta.ulakbim.gov.tr/record/286136", 
    "latest": "https://aperta.ulakbim.gov.tr/api/records/286136", 
    "latest_html": "https://aperta.ulakbim.gov.tr/record/286136"
  }, 
  "metadata": {
    "access_right": "open", 
    "access_right_category": "success", 
    "creators": [
      {
        "affiliation": "Yalova \u00dcniversitesi", 
        "name": "\u0130smail Bu\u011fra B\u00f6l\u00fckba\u015f\u0131", 
        "orcid": "0000-0002-9405-0900"
      }, 
      {
        "affiliation": "Bursa Uluda\u011f \u00dcniversitesi", 
        "name": "Bet\u00fcl Ya\u011fmahan", 
        "orcid": "0000-0003-1744-3062"
      }
    ], 
    "description": "<p>In recent years, the number of people with diabetes has been increasing daily. Diabetes is an important<br>\ndisease that can cause serious damage to the body in the future and even cause death if precautions are<br>\nnot taken. Early and accurate detection of ever-increasing diabetes is gaining more importance in the<br>\nmedical world. The number of studies using machine learning methods to diagnose diabetes has<br>\nincreased significantly in the literature.<br>\nIn this study, type-2 diabetes disease was classified using different data preprocessing and machine<br>\nlearning methods on real-world data taken from a public hospital in Turkey. Logistic regression, Naive<br>\nBayes, C4.5, and Random Forest classification models were used in the study. In the classification<br>\nmodels, the patient&#39;s age, gender, complete blood count, biochemistry, and hormone test results were<br>\nused as input variables, and the disease diagnosis made by specialist doctors was used as output variable.<br>\nIn total, 43 different variables were studied. When the dataset was examined, it was noticed that there<br>\nwas an imbalance between the classes in the target variable. In cases where there is a class imbalance,<br>\nthe classification models can make incorrect assignments to the classes. To eliminate the class imbalance<br>\nin the data set used in the study, three different resampling methods were used: random undersampling<br>\n(RUS), random oversampling (ROS), and synthetic minority oversampling (SMOTE).<br>\nThe performances of four different machine learning methods were compared on each of the original<br>\ntraining dataset, random undersampled training dataset, random oversampled training dataset, and<br>\nsynthetic minority oversampled training dataset. A total of 16 different scenarios were studied.<br>\nAs a result of the analysis of all scenarios, four combinations that give the best results were determined.<br>\nThese are Naive Bayes working with original training dataset, Random Forest working with random<br>\nundersampled training and synthetic minority oversampled training datasets, and C4.5 algorithm<br>\nworking with random oversampled training dataset. The algorithm that takes the first place among the<br>\nfour scenarios that show the best results is the Random Forest algorithm working with random<br>\nundersampled training dataset.</p>", 
    "doi": "10.48623/aperta.286136", 
    "has_grant": false, 
    "imprint": {
      "isbn": "987-625-8246-29-2", 
      "place": "Adana", 
      "publisher": "IKSAD Publishing"
    }, 
    "keywords": [
      "Diabetes Diagnosis", 
      "Type-2 Diabetes", 
      "Machine Learning", 
      "Classification", 
      "Imbalanced Dataset", 
      "Resampling Methods"
    ], 
    "license": {
      "id": "cc-by-sa"
    }, 
    "meeting": {
      "dates": "October 09-11, 2022", 
      "place": "Adana", 
      "title": "CUKUROVA 9th INTERNATIONAL SCIENTIFIC RESEARCHES CONFERENCE", 
      "url": "https://www.iksadkongre.net/_files/ugd/262ebf_bea0ec7391e34e8884032d2c83362198.pdf"
    }, 
    "part_of": {
      "pages": "330-331", 
      "title": "ABSTRACT BOOK"
    }, 
    "publication_date": "2022-10-22", 
    "related_identifiers": [
      {
        "identifier": "10.48623/aperta.286135", 
        "relation": "isVersionOf", 
        "scheme": "doi"
      }
    ], 
    "relations": {
      "version": [
        {
          "count": 1, 
          "index": 0, 
          "is_last": true, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "286136"
          }, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "286135"
          }
        }
      ]
    }, 
    "resource_type": {
      "subtype": "conferencepaper", 
      "title": "Konferans bildirisi", 
      "type": "publication"
    }, 
    "science_branches": [
      "Sa\u011fl\u0131k Bilimleri > T\u0131p > Dahili T\u0131p Bilimleri > \u0130\u00e7 Hastal\u0131klar\u0131 > Endokrinoloji ve Metabolizma Hastal\u0131klar\u0131"
    ], 
    "title": "DIAGNOSIS OF DIABETES DISEASE USING MACHINE LEARNING METHODS IN AN IMBALANCED DIABETES DATASET"
  }, 
  "owners": [
    2749
  ], 
  "revision": 2, 
  "stats": {
    "downloads": 0.0, 
    "unique_downloads": 0.0, 
    "unique_views": 0.0, 
    "version_downloads": 0.0, 
    "version_unique_downloads": 0.0, 
    "version_unique_views": 0.0, 
    "version_views": 0.0, 
    "version_volume": 0.0, 
    "views": 0.0, 
    "volume": 0.0
  }, 
  "updated": "2025-07-31T16:21:01.047620+00:00"
}
0
0
görüntülenme
indirilme
Tüm sürümler Bu sürüm
Görüntülenme 00
İndirme 00
Veri hacmi 0 Bytes0 Bytes
Tekil görüntülenme 00
Tekil indirme 00

Alıntı yap