Published January 1, 2021
| Version v1
Journal article
Open
Prediction of tumour pathological subtype from genomic profile using sparse logistic regression with random effects
- 1. Ankara Univ, Dept Stat, Ankara, Turkey
- 2. Prince Sattam Bin Abdulaziz Univ, Coll Sci & Humanitarian Studies, Dept Math, Al Kharj, Saudi Arabia
- 3. St Jamess Univ Leeds, Leeds Inst Med Res, Leeds, W Yorkshire, England
- 4. Univ Leeds, Dept Stat, Leeds, W Yorkshire, England
Description
The purpose of this study is to highlight the application of sparse logistic regression models in dealing with prediction of tumour pathological subtypes based on lung cancer patients' genomic information. We consider sparse logistic regression models to deal with the high dimensionality and correlation between genomic regions. In a hierarchical likelihood (HL) method, it is assumed that the random effects follow a normal distribution and its variance is assumed to follow a gamma distribution. This formulation considers ridge and lasso penalties as special cases. We extend the HL penalty to include a ridge penalty (called 'HLnet') in a similar principle of the elastic net penalty, which is constructed from lasso penalty. The results indicate that the HL penalty creates more sparse estimates than lasso penalty with comparable prediction performance, while HLnet and elastic net penalties have the best prediction performance in real data. We illustrate the methods in a lung cancer study.
Files
bib-d678d957-74af-4525-9652-2400ea7332fb.txt
Files
(222 Bytes)
| Name | Size | Download all |
|---|---|---|
|
md5:d575bfdcea0ea1b0ee3e4525bcaf1aad
|
222 Bytes | Preview Download |