Learning the Pareto Set Under Incomplete Preferences: Pure Exploration in Vector Bandits

Karagozlu, Efe Mert; Yildirim, Yasar Cahit; Ararat, Cagin; Tekin, Cem

doi:10.48623/aperta.279431

Yayınlanmış 1 Ocak 2024 | Sürüm v1

Konferans bildirisi Açık

Learning the Pareto Set Under Incomplete Preferences: Pure Exploration in Vector Bandits

1. Bilkent Univ, Ankara, Turkiye

We study pure exploration in bandit problems with vector-valued rewards, where the goal is to (approximately) identify the Pareto set of arms given incomplete preferences induced by a polyhedral convex cone. We address the open problem of designing sampleefficient learning algorithms for such problems. We propose Pareto Vector Bandits (PaVeBa), an adaptive elimination algorithm that nearly matches the gap-dependent and worst-case lower bounds on the sample complexity of (., d)-PAC Pareto set identification. Finally, we provide an in-depth numerical investigation of PaVeBa and its heuristic vari-ants by comparing them with the state-of-the-art multi-objective and vector optimization algorithms on several real-world datasets with conflicting objectives.

Dosyalar

bib-87c85442-1a50-4a48-be5f-e8e6f8661b49.txt

Dosyalar (227 Bytes)

Ad	Boyut	Hepisini indir
bib-87c85442-1a50-4a48-be5f-e8e6f8661b49.txt md5:187a213f24d74497cf648dda6f61b450	227 Bytes	Ön İzleme İndir

	Tüm sürümler	Bu sürüm
Görüntüleme	7	7
İndirilenler	2	2
Veri miktarı	454 Bytes	454 Bytes

Learning the Pareto Set Under Incomplete Preferences: Pure Exploration in Vector Bandits

Dosyalar

bib-87c85442-1a50-4a48-be5f-e8e6f8661b49.txt

Dosyalar (227 Bytes)

TÜBİTAK ULAKBİM

İLETİŞİM

Learning the Pareto Set Under Incomplete Preferences: Pure Exploration in Vector Bandits

Oluşturanlar

Açıklama

Dosyalar

bib-87c85442-1a50-4a48-be5f-e8e6f8661b49.txt

Dosyalar (227 Bytes)