Four level provenance support to achieve portable reproducibility of scientific workflows

A. Bánáti, P. Kacsuk, M. Kozlovszky

Research output: Chapter in Book/Report/Conference proceedingConference contribution

11 Citations (Scopus)

Abstract

In the scientist's community one of the most vital challenges is the issue of reproducibility of workflow execution. In order to reproduce the results of an experiment, on one hand provenance information must be collected and on the other hand the dependencies of the execution need to be eliminated. Concerning the workflow execution environment we have differentiated four levels of provenance: infrastructural, environmental, workflow and data provenance. During the re-execution at all levels the components can change and capturing the data of each levels targets different problems to solve. For example storing the environmental and infrastructural parameters enables the portability of workflows between the different parallel and distributed systems (grid, HPC, cloud). The describers of the workflow model enable tracking the different versions of the workflow and their impacts on the execution. Our goal is to capture the most optimal parameters in number and type as well and reconstruct the way of data production independently from the environment. In this paper we investigate the necessary and satisfactory parameters of workflow reproducibility and give a mathematical formula to determine the rate of reproducibility. These measurements allow the scientist to make a decision about the next steps toward the creation of reproducible workflows.

Original languageEnglish
Title of host publication2015 38th International Convention on Information and Communication Technology, Electronics and Microelectronics, MIPRO 2015 - Proceedings
EditorsVlado Sruk, Zeljko Butkovic, Boris Vrdoljak, Andrej Sokolic, Stjepan Gros, Petar Biljanovic, Karolj Skala, Slobodan Ribaric, Branko Mikac, Marina Cicin-Sain, Mladen Mauher
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages241-244
Number of pages4
ISBN (Electronic)9789532330854
DOIs
Publication statusPublished - Jul 15 2015
Event38th International Convention on Information and Communication Technology, Electronics and Microelectronics, MIPRO 2015 - Opatija, Croatia
Duration: May 25 2015May 29 2015

Publication series

Name2015 38th International Convention on Information and Communication Technology, Electronics and Microelectronics, MIPRO 2015 - Proceedings

Other

Other38th International Convention on Information and Communication Technology, Electronics and Microelectronics, MIPRO 2015
CountryCroatia
CityOpatija
Period5/25/155/29/15

    Fingerprint

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Computer Science Applications
  • Hardware and Architecture
  • Electrical and Electronic Engineering

Cite this

Bánáti, A., Kacsuk, P., & Kozlovszky, M. (2015). Four level provenance support to achieve portable reproducibility of scientific workflows. In V. Sruk, Z. Butkovic, B. Vrdoljak, A. Sokolic, S. Gros, P. Biljanovic, K. Skala, S. Ribaric, B. Mikac, M. Cicin-Sain, & M. Mauher (Eds.), 2015 38th International Convention on Information and Communication Technology, Electronics and Microelectronics, MIPRO 2015 - Proceedings (pp. 241-244). [7160272] (2015 38th International Convention on Information and Communication Technology, Electronics and Microelectronics, MIPRO 2015 - Proceedings). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/MIPRO.2015.7160272