How to specify distribution/sort style and column encoding for dbt seeds in Redshift?

evgeniy · September 1, 2022, 1:19pm

How to specify distribution and sort style for dbt seeds?

There are no .sql files associated with seeds. How do we define the config? And how do we define column encoding algorithms?

joellabes · September 4, 2022, 11:29pm

Hey @evgeniy, as far as I know you can’t define dist/sort style for seeds. In general, you probably shouldn’t need to though as they’re small files, so even if the seed isn’t on the same node it would be trivial for Redshift to copy it over.

With that said, if you do need to configure this then there’s a workaround: you could make a model that just passes the seed through:

-- models/my_seed_with_dist_key_set.sql
{{ config(materialized='table',  sort='id',  dist='received_at') }}

select * from {{ ref('my_seed') }}

and when that gets materialized it will behave as you’d expect. It’s a bit suboptimal because you’d have two copies of the table in your warehouse, but you can materialize your seeds into a different schema that your BI tools etc can’t access if you want to.

This would also be a good thing to add as an issue on the dbt-redshift repo.

system · September 11, 2022, 11:29pm

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Seed data column type changes aren't being applied Help seeds	5	4523	September 4, 2022
dbt seed error with schema config Help seeds , redshift , custom-schema , dbt-core	2	439	September 18, 2024
Why is DBT changing datatypes, column lengths and column order Archive	1	3083	December 31, 2021
loading seed with SUPER type to Redshift Help seeds , redshift	1	1090	January 4, 2023
Is it possible to import a redshift table structure into a source definition (schema.yml) Help redshift	5	1088	July 3, 2023

How to specify distribution/sort style and column encoding for dbt seeds in Redshift?

Related topics