Application and middleware transparent checkpointing with TCKPT on ClusterGrids

J. Kovács, P. Kacsuk, Radoslaw Januszewski, Gracjan Jankowski

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

This paper introduces a combination of the existing parallel checkpointing techniques for software heterogeneous ClusterGrid infrastructures. Most of the existing solutions are aiming at supporting application transparency (no checkpoint related code development in application), but some others build middleware transparent (no service modification) solutions. The main contribution of this paper is to introduce a solution providing both application and middleware transparency at the same time. Compatibility and integrity requirements are identified and corresponding conditions are established using Abstract State Machines. The most relevant checkpointing systems are checked against the conditions in order to examine their conformity. Based on the conditions, a novel checkpointing method is defined and a proof of concept checkpointing tool, called TotalCheckpoint (TCKPT) is introduced.

Original languageEnglish
Pages (from-to)498-503
Number of pages6
JournalFuture Generation Computer Systems
Volume26
Issue number3
DOIs
Publication statusPublished - Mar 2010

Fingerprint

Middleware
Transparency

ASJC Scopus subject areas

  • Hardware and Architecture
  • Software
  • Computer Networks and Communications

Cite this

Application and middleware transparent checkpointing with TCKPT on ClusterGrids. / Kovács, J.; Kacsuk, P.; Januszewski, Radoslaw; Jankowski, Gracjan.

In: Future Generation Computer Systems, Vol. 26, No. 3, 03.2010, p. 498-503.

Research output: Contribution to journalArticle

Kovács, J. ; Kacsuk, P. ; Januszewski, Radoslaw ; Jankowski, Gracjan. / Application and middleware transparent checkpointing with TCKPT on ClusterGrids. In: Future Generation Computer Systems. 2010 ; Vol. 26, No. 3. pp. 498-503.
@article{070ea7b69ab2489c83fa9d7b56854e15,
title = "Application and middleware transparent checkpointing with TCKPT on ClusterGrids",
abstract = "This paper introduces a combination of the existing parallel checkpointing techniques for software heterogeneous ClusterGrid infrastructures. Most of the existing solutions are aiming at supporting application transparency (no checkpoint related code development in application), but some others build middleware transparent (no service modification) solutions. The main contribution of this paper is to introduce a solution providing both application and middleware transparency at the same time. Compatibility and integrity requirements are identified and corresponding conditions are established using Abstract State Machines. The most relevant checkpointing systems are checked against the conditions in order to examine their conformity. Based on the conditions, a novel checkpointing method is defined and a proof of concept checkpointing tool, called TotalCheckpoint (TCKPT) is introduced.",
author = "J. Kov{\'a}cs and P. Kacsuk and Radoslaw Januszewski and Gracjan Jankowski",
year = "2010",
month = "3",
doi = "10.1016/j.future.2009.07.013",
language = "English",
volume = "26",
pages = "498--503",
journal = "Future Generation Computer Systems",
issn = "0167-739X",
publisher = "Elsevier",
number = "3",

}

TY - JOUR

T1 - Application and middleware transparent checkpointing with TCKPT on ClusterGrids

AU - Kovács, J.

AU - Kacsuk, P.

AU - Januszewski, Radoslaw

AU - Jankowski, Gracjan

PY - 2010/3

Y1 - 2010/3

N2 - This paper introduces a combination of the existing parallel checkpointing techniques for software heterogeneous ClusterGrid infrastructures. Most of the existing solutions are aiming at supporting application transparency (no checkpoint related code development in application), but some others build middleware transparent (no service modification) solutions. The main contribution of this paper is to introduce a solution providing both application and middleware transparency at the same time. Compatibility and integrity requirements are identified and corresponding conditions are established using Abstract State Machines. The most relevant checkpointing systems are checked against the conditions in order to examine their conformity. Based on the conditions, a novel checkpointing method is defined and a proof of concept checkpointing tool, called TotalCheckpoint (TCKPT) is introduced.

AB - This paper introduces a combination of the existing parallel checkpointing techniques for software heterogeneous ClusterGrid infrastructures. Most of the existing solutions are aiming at supporting application transparency (no checkpoint related code development in application), but some others build middleware transparent (no service modification) solutions. The main contribution of this paper is to introduce a solution providing both application and middleware transparency at the same time. Compatibility and integrity requirements are identified and corresponding conditions are established using Abstract State Machines. The most relevant checkpointing systems are checked against the conditions in order to examine their conformity. Based on the conditions, a novel checkpointing method is defined and a proof of concept checkpointing tool, called TotalCheckpoint (TCKPT) is introduced.

UR - http://www.scopus.com/inward/record.url?scp=70449722858&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=70449722858&partnerID=8YFLogxK

U2 - 10.1016/j.future.2009.07.013

DO - 10.1016/j.future.2009.07.013

M3 - Article

AN - SCOPUS:70449722858

VL - 26

SP - 498

EP - 503

JO - Future Generation Computer Systems

JF - Future Generation Computer Systems

SN - 0167-739X

IS - 3

ER -