Exploiting Relevance for Online Decision-Making in High-Dimensions

Turgay, Eralp; Bulucu, Cem; Tekin, Cem

doi:10.1109/TSP.2020.3048223

Yayınlanmış 1 Ocak 2021 | Sürüm v1

Dergi makalesi Açık

Exploiting Relevance for Online Decision-Making in High-Dimensions

1. Bilkent Univ, Dept Elect & Elect Engn, TR-06800 Ankara, Turkey

Many sequential decision-making tasks require choosing at each decision step the right action out of the vast set of possibilities by extracting actionable intelligence from high-dimensional data streams. Most of the times, the high-dimensionality of actions and data makes learning of the optimal actions by traditional learning methods impracticable. In this work, we investigate how to discover and leverage sparsity in actions and data to enable fast learning. As our learning model, we consider a structured contextual multi-armed bandit (CMAB) with high-dimensional arm (action) and context (data) sets, where the rewards depend only on a few relevant dimensions of the joint context-arm set, possibly in a non-linear way. We depart from the prior work by assuming a high-dimensional, continuum set of arms, and allow relevant context dimensions to vary for each arm. We propose a new online learning algorithm called CMAB with Relevance Learning (CMAB-RL). CMAB-RL enjoys a substantially improved regret bound compared to classical CMAB algorithms whose regrets depend on the number of dimensions d(x) and d(a) of the context and arm sets. Importantly, we showthat when the learner has prior knowledge on sparsity, given in terms of upper bounds (d) over bar (x) and (d) over bar (a) on the number of relevant context and arm dimensions, then CMABRL achieves (O) over tilde (T1-1/((2+2 (d) over barx+(d) over bara))) regret. Finally, we illustrate how CMAB algorithms can be used for optimal personalized blood glucose control in type 1 diabetes mellitus patients, and show that CMAB-RL outperforms other contextual MAB algorithms in this task.

Dosyalar

bib-04565dc0-242b-4b5b-8e22-1bc5d2cbaf9e.txt

Dosyalar (171 Bytes)

Ad	Boyut	Hepisini indir
bib-04565dc0-242b-4b5b-8e22-1bc5d2cbaf9e.txt md5:c3cd59682935551f5a77730a570b2ed5	171 Bytes	Ön İzleme İndir

	Tüm sürümler	Bu sürüm
Görüntüleme	28	28
İndirilenler	17	17
Veri miktarı	2.9 kB	2.9 kB

Exploiting Relevance for Online Decision-Making in High-Dimensions

Dosyalar

bib-04565dc0-242b-4b5b-8e22-1bc5d2cbaf9e.txt

Dosyalar (171 Bytes)

TÜBİTAK ULAKBİM

İLETİŞİM

Exploiting Relevance for Online Decision-Making in High-Dimensions

Oluşturanlar

Açıklama

Dosyalar

bib-04565dc0-242b-4b5b-8e22-1bc5d2cbaf9e.txt

Dosyalar (171 Bytes)