Authors: John Casey, Wanlei Zhou
Addresses: UXC Applications Development, Hobart, TAS 7001, Australia. ' School of Information Technology, Deakin University, Victoria, Australia
Abstract: This paper describes the design and evaluation of a peer-to-peer indexing system to integrate the resources of local document database systems into a globally addressable index using a distributed hash table. The salient feature of the indexing systems design is the efficient dissemination of term-document indices using a combination of duplicate elimination, ring based forwarding and conventional techniques such as aggressive index pruning, and batching. Together these indexing strategies help to reduce the number of RPC operations required to locate the nodes responsible for a section of the index, the bandwidth utilisation and the latency of the indexing service.
Keywords: DHT; distributed hash tables; document indexing; difference encoding; P2P; peer-to-peer indexing; bandwidth reduction; latency.
International Journal of High Performance Computing and Networking, 2009 Vol.6 No.2, pp.119 - 129
Available online: 26 Jul 2009Full-text access for editors Access for subscribers Purchase this article Comment on this article