Version 4.2 by adavison on 2020/08/05 08:40

Show last authors
1 == Introduction ==
2
3 Computational provenance is a record of all the steps in a computational scientific workflow, including the code that was run, input data, the computational environment (hardware, OS, compiler versions, library version...), and output data.
4
5 Capturing computational provenance facilitates:
6
7 * reproducibility of results
8 * management and tracking of workflows/projects by the scientists/engineers involved
9 * evaluation/review by other scientists and engineers
10
11
12
13 == Standards ==
14
15 Information about the W3C PROV ontology and related tools
16
17
18 == Storage of provenance in the Knowledge Graph ==
19
20
21 == Tools for automated capture of provenance ==
22
23 * on different systems:
24 ** HPC systems
25 ** neuromorphic systems
26 ** Jupyter notebooks
27 ** users' own computers
28 * prospective/pre-emptive vs run-time provenance capture
29 * capture of metadata vs capture of artefacts
30
31 == Communication between computer systems and the KG ==
32
33 * local cache and synchronization?
34
35
36 == User interfaces for browsing, visualizing, and searching provenance information ==
37
38