Models as source for snapshots

jesse · July 15, 2019, 8:59pm

Hey there, I have a Redshift Spectrum table that contains a bunch of JSON data. One column has JSON like this:

   "user_identities":[  
      {  
         "identity_type":"email",
         "identity":"bob@bob.com",
         "timestamp_unixtime_ms":1540483995196
      },
      {  
         "identity_type":"customer_id",
         "identity":"12345",
         "timestamp_unixtime_ms":1540483995196
      }
   ]
}

So far as I can tell, in Redshift Spectrum, I can only access the data in this object by declaring the full path, which means I can’t just snapshot the Spectrum table. With this in mind, I have a model that creates a table like this:

| email       | customer_id | updated_at          |
|-------------|-------------|---------------------|
| bob@bob.com | 12345       | 2019-07-06 21:41:10 |

What I’m wondering is if there are any downsides to using this model as a snapshot source. The potential issue I can see is if the dbt runs overlap, but I think I can mostly mitigate that. Any other potential issues?

drew · July 23, 2019, 1:09am

Hey @jesse - one of the cool things about Snapshots is that you can now specify a query to snapshot. I think it was a pretty common workaround to “archive” a model, but that definitely leads to all sorts of quirks in dbt runs. You’d need to run some of your models, then your archive, then the rest of your models!

Instead, I think the move is definitely to implement the select logic that you’re outlining here in the body of your snapshot block. Does it sound like that would fit your needs?

jesse · July 23, 2019, 2:28pm

Sounds like it should work perfectly! Thanks again for your help.

Topic		Replies	Views
Using fromjson for JSON files Archive	0	2545	September 6, 2020
Using DBT Archive Archive	2	4644	September 14, 2018
can i create an scd type 2 model in dbt without using snapshot? Help	4	3066	March 1, 2023
dbt cloud +Red-shift not recognizing columns while trying to run snapshot Help snapshots , redshift , postgres , dbt-core	0	68	August 2, 2024
Error creating sql table <schema>.base_node_relationships Help redshift , dbt-cloud	0	1451	October 17, 2023

Models as source for snapshots

Related topics