Working with Data Assets in a Capsule

Attaching Data Assets to a Capsule

You can attach a data asset to one or more capsules to use in your computations. This means that the data won’t be copied back and forth to your Cloud Workstation, which reduces the capsule's size and speeds up the cloud workstation.

From the IDE

  1. Go to the data folder and click Manage Datasets.

  2. Attach/Detach Data side panel will appear. Click on the paper clip icon to attach the Data Asset.

From a Cloud Workstation (e.g., RStudio, Terminal, or Jupyter)

  1. Go to the data folder and click Manage Datasets.

  2. Attach/Detach Data side panel will appear. Click on the paper clip icon to attach the Data Asset.

When attaching an external dataset, the following points should be considered:

  1. To link to an external dataset, you need to add the secret or Assumed Role to the capsule for accessing AWS S3. Check out Accessing a Secret in a capsule for more information.

  2. Due to AWS S3 behavior, it takes time to reach files.

Viewing the Detail of the Data Asset when attaching

From the Attach/Detach Data page, metadata can be viewed without leaving the Cloud Workstation page.

To retrieve this information:

  1. Hover over the data asset, which does not have to be attached

  2. Click Data Details

While in the Cloud Workstation, the same Data information will display as it's displayed on the Data dashboard.

Renaming the Mounting Point (Folder)

After you have attached a dataset to your capsule, you can change the name of the mounting point (that is, the folder name) in the Code Ocean IDE.

  1. Hover over the attached dataset under the /data folder.

  2. Click on the down arrow.

  3. Select Rename.