Snapshotting from an existing SCD Type 2 table

bdenner · April 5, 2022, 1:05pm

Background:
We have a ‘lakehouse’ set up in which the source data is brought into a staging schema, then history is tracked and stored in an ods schema on top of that using SCD type 2.
The tables I am now modelling for the warehouse side have many months of history already stored in this ods layer as SCD type 2. The changes are tracked across all columns in this ods layer.
However, for our dimensional data warehouse, only some of the columns in this table need to be tracked for changes to build an timestamped accumulating snapshot fact table.

Problem:
dbt snapshotting capability seems to be solely for tracking changes which occur to SCD type 1 table (overwrite).
If I follow this rule, I would need to create the snapshot from our staging schema. The baseline snapshot would then be the most recent version of each row, and I would lose all of the accumulated history already stored in the ods.
Alternatively, I try to create an SCD type 2 table over the existing SCD type 2 table (ods schema), tracking against fewer columns using the dbt ‘check strategy’. There are then two issues to tackle:

We need an initial load which re-constructs the SCD type 2 (but tracking changes over fewer columns) for all the existing history.
dbt needs to be able to track changes which are loaded as new rows rather than overwrites.

Issue number (2) in particular I would expect dbt to have a solution for. It is not uncommon to have a source table which is incrementally loaded rather than type 1 updates. We have plenty of data sources which are event based xml, so each time a change is made the whole row is extracted and sent to us as xml. If dbt is unable to create SCD on this type of incrementally loaded source data, this is a serious limitation.

Has anyone found a solution to this?

Topic		Replies	Views
SCD type 2 in using dbt Archive	2	4887	March 23, 2021
How to implement SCD Type 2 without using snapshot Archive	1	3431	June 20, 2022
how to meet the requirement for the traditional design pattern for scd type 2 dimensional tables in the mart layer using dbt Help snapshots , incremental	2	1267	July 28, 2023
Can I implement SCD2 in dbt without using snapshots? Help snapshots	6	8782	August 16, 2021
can i create an scd type 2 model in dbt without using snapshot? Help	4	3159	March 1, 2023

Snapshotting from an existing SCD Type 2 table

Related topics