Connecting scientific data to scientific experiments with provenance

S Miles, E Deelman, P Groth, K Vahi, G Mehta, L Moreau, K Chiu (Editor), R Buyya (Editor)

Research output: Chapter in Book/Report/Conference proceedingConference paper

27 Citations (Scopus)
195 Downloads (Pure)

Abstract

As scientific workflows and the data they operate on, grow in size and complexity, the task of defining how those workflows should execute (which resources to use, where the resources must be in readiness for processing etc.) becomes proportionally more difficult. While "workflow compilers", such as Pegasus, reduce this burden, a further problem arises: since specifying details of execution is now automatic, a workflow's results are harder to interpret, as they are partly due to specifics of execution. By automating steps between the experiment design and its results, we lose the connection between them, hindering interpretation of results. To reconnect the scientific data with the original experiment, we argue that scientists should have access to the full provenance of their data, including not only parameters, inputs and intermediary data, but also the abstract experiment, refined into a concrete execution by the "workflow compiler". In this paper, we describe preliminary work on adapting Pegasus to capture the process of workflow refinement in the PASOA provenance system
Original languageEnglish
Title of host publicationThird IEEE International Conference on e-Science and Grid Computing
Subtitle of host publicationBangalore, India. 10-13 December 2007
EditorsGeoffry Fox, Kenneth Chiu, Rajkumar Buyya
Place of PublicationLos Alamitos, CA
PublisherIEEE Computer Society
Pages179-186
Number of pages8
ISBN (Print)9780769530642, 0769530648
DOIs
Publication statusPublished - 2007
Event3rd IEEE International Conference on e-Science and Grid Computing - Bangalore, India
Duration: 1 Jan 2007 → …

Conference

Conference3rd IEEE International Conference on e-Science and Grid Computing
Country/TerritoryIndia
CityBangalore
Period1/01/2007 → …

Fingerprint

Dive into the research topics of 'Connecting scientific data to scientific experiments with provenance'. Together they form a unique fingerprint.

Cite this