Data Connectors
Each data connector consists of two Capsules, one which performs a query and outputs a data file (in parquet, text or other tabular format) and one which will create a Data Asset automatically from the results.

Databricks - Data Connector
Query a Databricks metastore to pull data using SQL.
None

Databricks - Data Asset Generation
Create a Data Asset from a SQL query selecting from a Databricks metastore.
None

BigQuery - Data
This Capsule makes a connection to Google Cloud's BigQuery, runs a query and downloads the results as a file.
None

BigQuery - Data Asset Generator
A Data Asset generator for Google BigQuery.
None

MySQL - Data Connector
This Capsule will query a MySQL database and output a file containing the requested data.
None

MySQL - Data Asset Generation
This Capsule will run the MySQL - Data Connector Capsule to query a MySQL database and automatically create a Data Asset from the result.
None

Snowflake - Data Connector
This Capsule will query a Snowflake database and output a file containing the requested data.
None

Snowflake - Data Asset Generation
This Capsule will run the Snowflake - Data Connector capsule to query a Snowflake database and automatically create a Data Asset from the result.
None

AWS Athena (Glue) Data Connector
This Capsule will submit a SQL query using Athena and will output a result file for use within Code Ocean.
None

AWS Athena (Glue) Data Asset Generation
This Capsule will perform an AWS Athena query by calling an external Capsule (AWS Glue Athena connector) and generating an output Data Asset.
None

Redshift Data Connector
This Capsule will query a Redshift database and output a file containing the requested data.
None

Redshift Data Connector - Data Asset Generation
This Capsule will run the Redshift Data Connector capsule to query a Redshift database and automatically create a Data Asset from the result.
None

Fetch data with ffq
This Capsule pulls data from GEO, SRA, EMBL-EBI, DDBJ or Biosample by accession number.
None

Download data from BaseSpace
This Capsule downloads demultiplexed (fastq.gz) or raw (bcl) Illumina sequencing data through the Illumina BaseSpace CLI. This Capsule requires a BaseSpace account and NGS data owned or shared with the user.
None

Publishing data to Tableau
The application is a Streamlit-based interface that simplifies interactions with Tableau. It assists in publishing new data sources to Tableau Cloud from local spreadsheet files.
.csv or .tsv files to upload to Tableau.
Was this helpful?