Aggregation by Provenance Types: A Technique for Summarising Provenance Graphs

Luc Moreau
(University of Southampton)

As users become confronted with a deluge of provenance data, dedicated techniques are required to make sense of this kind of information. We present Aggregation by Provenance Types, a provenance graph analysis that is capable of generating provenance graph summaries. It proceeds by converting provenance paths up to some length k to attributes, referred to as provenance types, and by grouping nodes that have the same provenance types. The summary also includes numeric values representing the frequency of nodes and edges in the original graph. A quantitative evaluation and a complexity analysis show that this technique is tractable; with small values of k, it can produce useful summaries and can help detect outliers. We illustrate how the generated summaries can further be used for conformance checking and visualization.

In Arend Rensink and Eduardo Zambon: Proceedings Graphs as Models (GaM 2015), London, UK, 11-12 April 2015, Electronic Proceedings in Theoretical Computer Science 181, pp. 129–144.
Published: 10th April 2015.

ArXived at: https://dx.doi.org/10.4204/EPTCS.181.9 bibtex PDF
References in reconstructed bibtex, XML and HTML format (approximated).
Comments and questions to: eptcs@eptcs.org
For website issues: webmaster@eptcs.org