Title: A synchronous checkpointing protocol for mobile distributed systems: probabilistic approach

Authors: Lalit Kumar Awasthi, P. Kumar

Addresses: Department of Computer Science and Engineering, National Institute of Technology, Hamirpur 177005, India. ' Computer Center, National Institute of Technology, Hamirpur 177005, India

Abstract: Coordinated checkpointing is a method that minimises number of processes to checkpoint for an initiation. It may require blocking of processes, extra synchronisation messages or useless checkpoints. We propose a minimum process coordinated checkpointing algorithm where the number of useless checkpoints and blocking are reduced using a probabilistic approach that computes an interacting set of processes on checkpoint initiation. A process checkpoints if the probability that it will get a checkpoint request in current initiation is high. A few processes may be blocked but they can continue their normal computation and may send messages. We also modified methodology to maintain exact dependencies.

Keywords: fault tolerance; distributed systems security; mobile systems; coordinated checkpointing; information security; computer security; systems assurance.

DOI: 10.1504/IJICS.2007.013957

International Journal of Information and Computer Security, 2007 Vol.1 No.3, pp.298 - 314

Published online: 05 Jun 2007 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article