Execute models a max of N times per day

Sheridan · March 15, 2023, 7:29pm

Hello, my dbt project is starting get large and models are getting used in multiple data flows. It’s great to re-use existing models but scheduling jobs efficiently is becoming a challenge. Currently, we have jobs to build “core” models like this: dbt run --select +mart_1+.

The problem is that a dependency for mart_1 can also be a dependency for mart_2. Mart_1 and mart_2 only really have 1 common table. If we schedule mart_2 as dbt run --select +mart_2+ then both lineage paths get run entirely. I really only want to run each model distinctly once, but in order of their dependencies.

My thought was to indicate to dbt that I want to only run models N times per day (once in this case). If a job triggers that model to run a second time, skip it and move to the next model. Is there a mechanism to do this or a more elegant way to solve the problem?

I could put the only dependency in as a source but that feels like it could create circular logic as time passes and the project gets more and more nested.

Topic		Replies	Views
Does dbt cloud support more than one job dependency Help dbt-cloud	4	175	November 15, 2024
Model run scheduling patterns Archive	2	4918	March 5, 2020
Running two dbt Cloud jobs back to back Help orchestration-and-deployment , dbt-cloud	6	4261	April 8, 2024
Schedule job dbt Help dbt-core	2	941	April 17, 2024
How to change execution commands with dbt jobs Help variables , orchestration-and-deployment	5	2141	April 17, 2023

Execute models a max of N times per day

Related topics