Create a pipeline in 5 minutes (Update)
This demonstration provides the steps for building a simple pipeline and processing data in parallel using Code Ocean Pipelines.
Create an example dataset containing reads from different samples. All reads for each sample will be in a separate folder. The pipeline will pass each folder to the downstream alignment, allowing each sample to process in parallel.
Bucket Name - codeocean-public-data
Path - example_datasets/Normox
Click My Pipelines to navigate to Pipelines
Click New Properties, Create New to create the pipeline
Click Manage Data Assets
Search for the Data Asset - Normox
Drag and drop the data asset onto the grid, to add it to the interface
My Capsules are from the Repository
Code Ocean Apps are prebuilt capsules from Code Ocean
Add FastQC capsules to the pipeline
Select the latest version
Connect Reads to FastQC
Connect FastQC to Results Bucket
Select Default as the Connection type
Click Confirm
Click Commit Changes in Reproducible Run
To change where the output files are stored adjust the Destination parameter
Click Reproducible Run
Normox had 6 samples with sequencing files
Instances of FastQC are spun up to process these samples in parallel
Once the run is complete, the results can be downloaded locally or included into a Data Asset
Pipeline with Parameters
The previous implementation used default parameters of the capsule. This demonstration provides parameters via App Builder for pipelines.
Click App Builder, Create App
Provide a subtitle for the pipeline
Include instructions explaining the function of the use cases of the pipeline in App instructions
Click Add to add data
Complete the fields, see App Panel Builder for details
Click Save
Under Parameters capsules can be arranged into any order and will be displayed on the App Panel in this order.
Click Finish
Click Run
The number of threads as per the FastQC instance, logging and levels, Normox has six samples with sequencing files. There are six instances of FastQC are spun up to process these samples in parallel
Once the run is complete, results can be downloaded locally or put into a data asset
Last updated