Components of a Pipeline

A standard pipeline consists of a data asset followed by a series of capsules that write results to a Results Bucket.

Each capsule is a standalone and fully reproducible process that reads data from the data folder and writes results to the results folder. When implemented in a pipeline, the contents of each capsule’s data folder are ignored. The data asset is passed into the first capsule’s data folder, and the results of the first capsule are passed to the data folder of the second capsule. Results from each capsule will only be saved if it is connected to the Results Bucket.

This section covers the main components of a pipeline:

  • Capsules

  • Data

  • Results Bucket

  • Map Paths

  • Capsule Settings

  • Pipeline Settings

Last updated