Distortions in Judged Spatial Relations in Large Language Models

Fulman, Nir; Memduhoglu, Abdulkadir; Zipf, Alexander

doi:10.1080/00330124.2024.2372792

Yayınlanmış 1 Ocak 2024 | Sürüm v1

Dergi makalesi Açık

Distortions in Judged Spatial Relations in Large Language Models

1. Heidelberg Univ, GISci Geoinformat Chair, Dept Geog, D-69120 Heidelberg, Germany

We present a benchmark for assessing the capability of large language models (LLMs) to discern intercardinal directions between geographic locations and apply it to three prominent LLMs: GPT-3.5, GPT-4, and Llama-2. This benchmark specifically evaluates whether LLMs exhibit a hierarchical spatial bias similar to humans, where judgments about individual locations' spatial relationships are influenced by the perceived relationships of the larger groups that contain them. To investigate this, we formulated fourteen questions focusing on well-known U.S. cities. Seven questions were designed to challenge the LLMs with scenarios potentially influenced by the orientation of larger geographical units, such as states or countries, whereas the remaining seven targeted locations were less susceptible to such hierarchical categorization. Among the tested models, GPT-4 exhibited superior performance with 55 percent accuracy, followed by GPT-3.5 at 47 percent and Llama-2 at 45 percent. The models showed significantly reduced accuracy on tasks with suspected hierarchical bias. For example, GPT-4's accuracy dropped to 33 percent on these tasks, compared to 86 percent on others. The models identified the nearest cardinal direction in most cases, however, reflecting their associative learning mechanism, thereby embodying human-like misconceptions. We discuss avenues for improving the spatial reasoning capabilities of LLMs.

Dosyalar

bib-2980ee2b-fc82-4027-9405-fe080a72a119.txt

Dosyalar (154 Bytes)

Ad	Boyut	Hepisini indir
bib-2980ee2b-fc82-4027-9405-fe080a72a119.txt md5:35ec73adc46f58f1fe1216430b07170d	154 Bytes	Ön İzleme İndir

	Tüm sürümler	Bu sürüm
Görüntüleme	11	11
İndirilenler	9	9
Veri miktarı	1.4 kB	1.4 kB

Distortions in Judged Spatial Relations in Large Language Models

Dosyalar

bib-2980ee2b-fc82-4027-9405-fe080a72a119.txt

Dosyalar (154 Bytes)

TÜBİTAK ULAKBİM

İLETİŞİM

Distortions in Judged Spatial Relations in Large Language Models

Oluşturanlar

Açıklama

Dosyalar

bib-2980ee2b-fc82-4027-9405-fe080a72a119.txt

Dosyalar (154 Bytes)