Skip to content

Instantly share code, notes, and snippets.

@edmondop
Created April 8, 2024 15:27
Show Gist options
  • Save edmondop/eaeb50202535af6c438c11ff46267804 to your computer and use it in GitHub Desktop.
Save edmondop/eaeb50202535af6c438c11ff46267804 to your computer and use it in GitHub Desktop.
datasets:
- from: databricks:spiceai.datasets.my_awesome_table // A reference to a table in the Databricks unity catalog
name: my_delta_lake_table
params:
endpoint: "https://dbc-a1b2345c-d6e7.cloud.databricks.com"
catalogs:
- name: mycatalog1
type: databricks
endpoint: https://dbc-a1b2345c-d6e7.cloud.databricks.com
- name: mycatalog2
type: aws
catalog_name: foobar
datasets:
- from: databricks:spiceai.datasets.catalogs.mycatalog1 // A reference to a table in the Databricks unity catalog
name: parquet_table // this could be a parquet table registered in the unity catalog
- from: databricks:spiceai.datasets.catalogs.mycatalog2
name: parquet_table // this could be a delta table registered in the glue catalog
- from: databricks:spiceai.datasets.object_store // A reference to a table in the Databricks unity catalog
name: my_csv_table
uri: s3://example-bucket/path/to/object
format: csv // here the format is necessary, since the metadata is not registered in the catalog
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment