Mansouri, Houssem and Aliouat, Makhlouf and Badache, Nadjib and Khan, Al Sakib Pathan (2015) A non-blocking coordinated checkpointing algorithm for message-passing systems. In: International Conference on Intelligent Information Processing, Security and Advanced Communication ( IPAC 2015 ), 23rd–25th November 2015, Batna, Algeria.
PDF
- Published Version
Restricted to Repository staff only Download (331kB) | Request a copy |
|
PDF
- Published Version
Restricted to Repository staff only Download (133kB) | Request a copy |
Abstract
This paper proposes an efficient non-blocking coordinated checkpointing algorithm for distributed message passing system which uses transitive dependency information. The processes synchronize their checkpointing activities so that a globally consistent set of checkpoints is always maintained in the system. These algorithms do not require channels to be FIFO (First-In, First-Out) and ensure that each checkpoint taken is part of a consistent global checkpoint. Our scheme also records a minimum number of checkpoints by making sure that only few processes are required to take checkpoints in any execution - it uses very less control-message cost when compared to other related works.
Item Type: | Conference or Workshop Item (Invited Papers) |
---|---|
Additional Information: | 6481/50744 |
Uncontrolled Keywords: | Fault tolerance, Checkpointing, Consistent global checkpoint. |
Subjects: | T Technology > T Technology (General) |
Kulliyyahs/Centres/Divisions/Institutes (Can select more than one option. Press CONTROL button): | Kulliyyah of Information and Communication Technology > Department of Computer Science Kulliyyah of Information and Communication Technology > Department of Computer Science |
Depositing User: | Dr. Al-Sakib Khan Pathan |
Date Deposited: | 21 May 2016 16:08 |
Last Modified: | 04 May 2017 09:28 |
URI: | http://irep.iium.edu.my/id/eprint/50744 |
Actions (login required)
View Item |