I was wondering if there’s a way to perform a full refresh on a subset of an incremental model. For example based on a date column.
Delete from my_table where date_column > "yyyy-mm-dd" would be the logical choice here, but it is a restricted environnement, and I can only run dbt commands from the cluster’s CLI.
I also thought of using a pre_hook but that means I have to modify my model and open a PR which isn’t very convenient.
Do you guys have any idea how I could solve this issue or a workaround to suggest ?
It’s very normal to want to run these types of DML statements on the data warehouse from time to time for maintenance, schema evolution etc. Like 10 years ago this was a huge part of the day-to-day work of a data engineer.
So the logical solution is to get your data/analytics engineers the ability to run DML statements against the data warehouse (probably through JIT privilege escalation) and have a process to make sure they don’t screw anything up too much when they do