A Cultural Algorithm for POMDPs from Stochastic Inventory Control

Prestwich, S. D.; Tarim, S. A.; Rossi, R.; Hnich, B.

doi:10.81043/aperta.42035

Yayınlanmış 1 Ocak 2008 | Sürüm v1

Konferans bildirisi Açık

A Cultural Algorithm for POMDPs from Stochastic Inventory Control

1. Cork Constraint Computat Ctr, Cork, Ireland
2. Hacettepe Univ, Dept Management, Ankara, Turkey
3. Izmir Univ Econ, Fac Comp Sci, Izmir, Turkey

Reinforcement Learning algorithms such as SARSA with an eligibility trace, and Evolutionary Computation methods such as genetic algorithms, are competing approaches to solving Partially Observable Markov Decision Processes (POMDPs) which occur in many fields of Artificial Intelligence. A powerful form of evolutionary algorithm that has not previously been applied to POMDPs is the cultural algorithm, in which evolving agents share knowledge in a belief space that is used to guide their evolution. We describe a cultural algorithm for POMDPs that hybridises SARSA with a noisy genetic algorithm, and inherits the latter's convergence properties. Its belief space is a common set of state-action values that are updated during genetic exploration, and conversely used to modify chromosomes. We use it to solve problems from stochastic inventory control by finding memoryless policies for nondeterministic POMDPs. Neither SARSA nor the genetic algorithm dominates the other on these problems, but the cultural algorithm outperforms the genetic algorithm, and on highly non-Markovian instances also outperforms SARSA.

Dosyalar

bib-22aa6c78-8528-498b-a2d7-b365a285c40f.txt

Dosyalar (157 Bytes)

Ad	Boyut	Hepisini indir
bib-22aa6c78-8528-498b-a2d7-b365a285c40f.txt md5:0cfdbb6e441e2e11430dca122f6f5602	157 Bytes	Ön İzleme İndir

	Tüm sürümler	Bu sürüm
Görüntüleme	54	54
İndirilenler	15	15
Veri miktarı	2.4 kB	2.4 kB

A Cultural Algorithm for POMDPs from Stochastic Inventory Control

Dosyalar

bib-22aa6c78-8528-498b-a2d7-b365a285c40f.txt

Dosyalar (157 Bytes)

TÜBİTAK ULAKBİM

İLETİŞİM

A Cultural Algorithm for POMDPs from Stochastic Inventory Control

Oluşturanlar

Açıklama

Dosyalar

bib-22aa6c78-8528-498b-a2d7-b365a285c40f.txt

Dosyalar (157 Bytes)