Published January 1, 2021 | Version v1
Journal article Open

Machine Learning Models for Classification of Cushing's Syndrome Using Retrospective Data

  • 1. TUBITAK BILGEM Informat & Informat Secur Res Ctr, TR-41470 Kocaeli, Turkey
  • 2. Izmir Katip Celebi Univ, Fac Med, Dept Endocrinol, TR-35620 Izmir, Turkey
  • 3. Dokuz Eylul Univ, Dept Internal Med, Div Endocrinol & Metab, Med Sch, TR-35340 Izmir, Turkey
  • 4. Fraunhofer Inst Mfg Engn & Automat IPA, Dept Biomechatron Syst, Nobelstr 12, D-70569 Stuttgart, Germany

Description

Accurate classification of Cushing's Syndrome (CS) plays a critical role in providing the early and correct diagnosis of CS that may facilitate treatment and improve patient outcomes. Diagnosis of CS is a complex process, which requires careful and concurrent interpretation of signs and symptoms, multiple biochemical test results, and findings of medical imaging by physicians with a high degree of specialty and knowledge to make correct judgments. In this article, we explore the state of the art machine learning algorithms to demonstrate their potential as a clinical decision support system to analyze and classify CS to facilitate the diagnosis, prognosis, and treatment of CS. Prominent algorithms are compared using nested cross-validation and various class comparison strategies including multiclass, one vs. all, and one vs. one binary classification. Our findings show that Random Forest (RF) algorithm is most suitable for the classification of CS. We demonstrate that the proposed approach can classify CS with an average accuracy of 92% and an average F1 score of 91.5%, depending on the class comparison strategy and selected features. RF-based one vs. all binary classification model achieves sensitivity of 97.6%, precision of 91.1%, and specificity of 87.1% to discriminate CS from non-CS on the test dataset. RF-based multiclass classification model achieves average per class sensitivity of 91.8%, average per class specificity of 97.1%, and average per class precision of 92.1% to classify different subtypes of CS on the test dataset. Clinical performance evaluation suggests that the developed models can help improve physicians' judgment in diagnosing CS.

Files

bib-af8d5c70-7b93-4538-a313-1e84712c055e.txt

Files (217 Bytes)

Name Size Download all
md5:2072bd6d0b860f3a1b547c353f690bfe
217 Bytes Preview Download