Incremental Inference of Provenance Types

Research output: Chapter in Book/Report/Conference proceedingConference paperpeer-review

195 Downloads (Pure)

Abstract

Long-running applications nowadays are increasingly instrumented to continuously log provenance. In that context, we observe an emerging need for processing fragments of provenance continuously produced by applications. Thus, there is an increasing requirement for a mode of incremental processing of provenance, while the application is still running, to replace batch processing of a complete provenance dataset available only after the application has completed. A process of particular interest is summarising provenance graphs, which has been proposed as an effective way of extracting key features of provenance and storing them in an efficient manner. To that goal, summarisation makes use of provenance types, which, in loose terms, are an encoding of the neighbourhood of nodes.

This paper shows that the process of creating provenance summaries of continuously provided data can benefit from a mode of incremental processing of provenance types. We also introduce the concept of a library of types to reduce the need for storing copies of the same string representations for types multiple times. Further, we show that the computational complexity associated with the task of inferring types is, in most common cases, the best possible: only new nodes have to be processed. We also identify and analyse the exception scenarios. Finally, although our library of types, in theory, can be exponentially large, we present empirical results that show it is in practice quite compact.
Original languageEnglish
Title of host publicationProvenance and Annotation of Data and Processes
Subtitle of host publication8th and 9th International Provenance and Annotation Workshop, IPAW 2020 + IPAW 2021, Virtual Event, July 19–22, 2021, Proceedings
PublisherSpringer, Cham
Pages145-162
Number of pages18
ISBN (Electronic)978-3-030-80960-7
ISBN (Print)978-3-030-80959-1
DOIs
Publication statusPublished - 9 Jul 2021

Fingerprint

Dive into the research topics of 'Incremental Inference of Provenance Types'. Together they form a unique fingerprint.

Cite this