Tactical UAV path optimization under radar threat using deep reinforcement learning

Alpdemir, M. Nedim

doi:10.1007/s00521-021-06702-3

Yayınlanmış 1 Ocak 2022 | Sürüm v1

Dergi makalesi Açık

Tactical UAV path optimization under radar threat using deep reinforcement learning

Alpdemir, M. Nedim¹

1. TUBITAK, Informat & Informat Secur Res Ctr BILGEM, Gebze, Turkey

The majority of the research efforts that aim to solve UAV path optimization problems in a Reinforcement Learning (RL) setting focus on closed spaces or urban areas as the operating environment. The problem of Tactical UAV (TUAV) path planning under hostile radar tracking threat has some peculiarities that distinguish it from other typical UAV path optimization problems. Particularly, 1-spatial regions delineated by threat probabilities may be legitimately penetrable under certain conditions that do not impair the survivability of the UAV and 2-A TUAV is detectable by a radar via its Radar Cross Section (RCS) which is a function of multiple parameters such as the radar operating frequency, the shape of the UAV and more importantly the engagement geometry between the radar and the UAV. The latter suggests that any maneuver performed by the UAV may change multiple angles that specify the engagement geometry. The work presented in this paper proposes a RL based solution to this complex problem in a novel way by 1-Implementing a Markov Decision Process (MDP) compliant RL environment with comprehensive probabilistic radar behavior models incorporated into it and 2-Integrating a core RL algorithm (namely DQN with Prioritized Experience Replay (DQN-PER) with a specific variant of transfer learning (namely learning from demonstrations (LfD)) in a single framework, demonstrating the utility of combining a core RL algorithm and a machine learning scheme toward boosting the performance of a learning agent, and more importantly to alleviate the sparse reward problem.

Dosyalar

bib-da3b3c80-91c2-4950-a65a-f5db8c45fad1.txt

Dosyalar (156 Bytes)

Ad	Boyut	Hepisini indir
bib-da3b3c80-91c2-4950-a65a-f5db8c45fad1.txt md5:cbcc82e40c693d4637cb9b3f3e3cbfd2	156 Bytes	Ön İzleme İndir

	Tüm sürümler	Bu sürüm
Görüntüleme	30	30
İndirilenler	12	12
Veri miktarı	1.9 kB	1.9 kB

Tactical UAV path optimization under radar threat using deep reinforcement learning

Dosyalar

bib-da3b3c80-91c2-4950-a65a-f5db8c45fad1.txt

Dosyalar (156 Bytes)

TÜBİTAK ULAKBİM

İLETİŞİM

Tactical UAV path optimization under radar threat using deep reinforcement learning

Oluşturanlar

Açıklama

Dosyalar

bib-da3b3c80-91c2-4950-a65a-f5db8c45fad1.txt

Dosyalar (156 Bytes)