A Fault Tolerance Management Framework for Wireless Sensor Networks

Iman Saleh1, Mohamed Eltoweissy1, Adnan Agbaria2, and Hesham El-Sayed3
1. Bradley Department of Electrical and Computer Engineering, Virginia Tech
2. IBM Haifa Research Laboratory, Haifa
3. College of Information Technology, United Arab Emirates University
Abstract—Wireless Sensor Networks (WSNs) have the potentialof significantly enhancing our ability to monitor andinteract with our physical environment. Realizing a faulttolerantoperation is critical to the success of WSNs. Themain challenge is providing fault tolerance (FT) while conservingthe limited resources of the network. Many schemeshave been proposed in this area. Our main contributionin this paper is to propose a general framework for faulttolerance in WSNs. The proposed framework can be usedto guide the design and development of FT solutions and toevaluate existing ones. We present a comparative study ofthe existing schemes and identify potential enhancements.A primary module of the framework is the learning andrefinement module which enables a FT solution to be adaptiveand self-configurable based on changes in the networkconditions. We view this as vital to the resource-constrainedand highly dynamic WSNs. Up to our knowledge, we arethe first to propose the implementation of such module inFT solutions for WSNs.

Index Terms—Networked sensor systems, Fault tolerance,Checkpoint/Restart

