Algorithms for within-cluster searches using inverted files

Altingovde, Ismail Sengor; Can, Fazli; Ulusoy, Ozgur

doi:10.48623/aperta.37989

Yayınlanmış 1 Ocak 2006 | Sürüm v1

Konferans bildirisi Açık

Algorithms for within-cluster searches using inverted files

1. Bilkent Univ, Dept Comp Engn, TR-06800 Ankara, Turkey

Information retrieval over clustered document collections has two successive stages: first identifying the best-clusters and then the best-documents in these clusters that are most similar to the user query. In this paper, we assume that an inverted file over the entire document collection is used for the latter stage. We propose and evaluate algorithms for within-cluster searches, i.e., to integrate the best-clusters with the best-documents to obtain the final output including the highest ranked documents only from the best-clusters. Our experiments on a TREC collection including 210,158 documents with several query sets show that an appropriately selected integration algorithm based on the query length and system resources can significantly improve the query evaluation efficiency.

Dosyalar

bib-24fda802-2bdb-4b0e-8e73-e20edcdf61f7.txt

Dosyalar (165 Bytes)

Ad	Boyut	Hepisini indir
bib-24fda802-2bdb-4b0e-8e73-e20edcdf61f7.txt md5:208fe6650433161431606088347e3757	165 Bytes	Ön İzleme İndir

	Tüm sürümler	Bu sürüm
Görüntüleme	63	63
İndirilenler	18	18
Veri miktarı	3.0 kB	3.0 kB

Algorithms for within-cluster searches using inverted files

Dosyalar

bib-24fda802-2bdb-4b0e-8e73-e20edcdf61f7.txt

Dosyalar (165 Bytes)

TÜBİTAK ULAKBİM

İLETİŞİM

Algorithms for within-cluster searches using inverted files

Oluşturanlar

Açıklama

Dosyalar

bib-24fda802-2bdb-4b0e-8e73-e20edcdf61f7.txt

Dosyalar (165 Bytes)