I have a question about unique column in Snapshot comparision while merge.

adhakal321 · August 4, 2025, 10:25am

The problem I’m having with dbt snapshot on dbt_scd_id.

The context of why I’m trying to do this:

I need to create an SCD Type 2 table. While generating the snapshot, a hashed column dbt_scd_id is created by concatenating the unique key columns with a timestamp (e.g., using current_timestamp()).

However, during comparison, the hash column includes the timestamp, which differs with each execution.

When I check the query history, I see that three different statements were executed at different times. These merge statements check for matches on specific columns, but I don’t understand how the hash is matched on the merge statement. Especially since dbt_scd_id includes current_timestamp(), which changes every run.

Am I doing something wrong, or am I misunderstanding how the hashing and merge logic work?

marcelo · August 4, 2025, 6:05pm

Hi Anil,

The dbt_scd_id column is not used to find changes. Only after dbt finds a change does it create a new version of the row. It’s at that moment that it generates a new dbt_scd_id to identify this new version.

Topic		Replies	Views
How is dbt_scd_id calculated? Archive	2	9520	October 1, 2021
Snapshots failing with Duplicate DBT_SCD_IDs Help snapshots , snowflake	1	2032	February 9, 2024
unique_key config for snapshots Help snapshots	1	2900	October 27, 2022
DBT Snapshot creating duplicate rows even though there's no change in source data Help snapshots , incremental , snowflake	6	4339	September 27, 2024
Snapshot behavior Help	1	2030	October 10, 2023

I have a question about unique column in Snapshot comparision while merge.

The problem I’m having with dbt snapshot on dbt_scd_id.

The context of why I’m trying to do this:

Related topics