Title: Reducing the bandwidth requirements of P2P keyword indexing

Authors: John Casey, Wanlei Zhou

Addresses: UXC Applications Development, Hobart, TAS 7001, Australia. ' School of Information Technology, Deakin University, Victoria, Australia

Abstract: This paper describes the design and evaluation of a peer-to-peer indexing system to integrate the resources of local document database systems into a globally addressable index using a distributed hash table. The salient feature of the indexing systems design is the efficient dissemination of term-document indices using a combination of duplicate elimination, ring based forwarding and conventional techniques such as aggressive index pruning, and batching. Together these indexing strategies help to reduce the number of RPC operations required to locate the nodes responsible for a section of the index, the bandwidth utilisation and the latency of the indexing service.

Keywords: DHT; distributed hash tables; document indexing; difference encoding; P2P; peer-to-peer indexing; bandwidth reduction; latency.

DOI: 10.1504/IJHPCN.2009.027462

International Journal of High Performance Computing and Networking, 2009 Vol.6 No.2, pp.119 - 129

Published online: 26 Jul 2009 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article