Published January 1, 2021 | Version v1
Journal article Open

Prediction of tumour pathological subtype from genomic profile using sparse logistic regression with random effects

  • 1. Ankara Univ, Dept Stat, Ankara, Turkey
  • 2. Prince Sattam Bin Abdulaziz Univ, Coll Sci & Humanitarian Studies, Dept Math, Al Kharj, Saudi Arabia
  • 3. St Jamess Univ Leeds, Leeds Inst Med Res, Leeds, W Yorkshire, England
  • 4. Univ Leeds, Dept Stat, Leeds, W Yorkshire, England

Description

The purpose of this study is to highlight the application of sparse logistic regression models in dealing with prediction of tumour pathological subtypes based on lung cancer patients' genomic information. We consider sparse logistic regression models to deal with the high dimensionality and correlation between genomic regions. In a hierarchical likelihood (HL) method, it is assumed that the random effects follow a normal distribution and its variance is assumed to follow a gamma distribution. This formulation considers ridge and lasso penalties as special cases. We extend the HL penalty to include a ridge penalty (called 'HLnet') in a similar principle of the elastic net penalty, which is constructed from lasso penalty. The results indicate that the HL penalty creates more sparse estimates than lasso penalty with comparable prediction performance, while HLnet and elastic net penalties have the best prediction performance in real data. We illustrate the methods in a lung cancer study.

Files

bib-d678d957-74af-4525-9652-2400ea7332fb.txt

Files (222 Bytes)

Name Size Download all
md5:d575bfdcea0ea1b0ee3e4525bcaf1aad
222 Bytes Preview Download