Published January 1, 2006 | Version v1
Conference paper Open

Effect of inverted index partitioning schemes on performance of query processing in parallel text retrieval systems

Description

Shared-nothing, parallel text retrieval systems require an inverted index, representing a document collection, to be partitioned among a number of processors. In general, the index can be partitioned based on either the terms or documents in the collection, and the way the partitioning is (lone greatly affects the query processing performance of the parallel system. In this work, we investigate the effect of these two index partitioning schemes on query processing. We conduct experiments on a 32-node PC cluster, considering the case where index is completely stored in disk. Performance results are reported for a large (30 GB) document collection using an MPI-based parallel query processing implementation.

Files

bib-37f25d72-762e-4441-8402-fbe1ba7436c8.txt

Files (224 Bytes)

Name Size Download all
md5:8e1b903f34d56f8396ea889f71cbfd10
224 Bytes Preview Download