Provenance of the Result Data Asset

Tracking with Lineage Graph

Result Data Assets automatically have a Lineage Graph generated that shows how the results were generated and whether they are reproducible.

In the graph, you can see which Capsules, Pipelines and Data Assets were used to produce a particular Results Data Asset. The color coding shows you if/how the final Results can be reproduced, and if a Data Asset can’t be reproduced, you’ll see exactly which step of the process or requisite data are no longer available.

Color Coding

  • Green: The Capsule(s)/Pipeline(s) and the Data Asset(s) required are versioned and kept on the platform and it is fully reproducible.

  • Red: A Capsule, Pipeline, or Data Asset is missing for reproducing the Results Data Asset. You can hover over the Results and see why it's no longer reproducible on the platform.

  • Blue: This Results Data Asset might be reproducible, but it cannot be guaranteed. This happens when a Capsule or Pipeline used to generate this Results Data Asset has not been released.

Preserve the Parameters when running with the App panel

The Result Data Asset can come from the Run with Parameters when working with the App panel. In this type of Reproducible Run, the user can run the Capsule with different parameters without modifying the script. Hence, the code version will not be sufficient to capture the full reproducible process.

Parameters Values are the values input to the App Panel and listed in the Provenance section of the Result details to record the parameters used in creating that results.