How we (used to) structure our dbt projects

claire · May 20, 2019, 6:14pm

It’s totally a matter of preference! I thought it was overkill when I started at Fishtown, but now I like it!

As rules of thumb:

We always like to have ref or source functions at the top of a file, as we feel that it makes it easy for us to understand the dependencies when first looking at a model. “Importing” them as CTEs helps us with this.
We pretty much always finish off our queries with a select * from my_final_cte. I find that this pattern helps me when writing SQL for a model, as it means I can keep chaining together CTEs easily, without having to go back and add a CTE name, parens, and indents to the part of the query I was just working on! I also find it easier for debugging during development

Overall these conventions help us ensure every analyst that works on our project writes SQL that looks consistent, which improves readability in the long run!

We’ve also done some quick investigation and found that on modern data warehouses it doesn’t impact the performance of your queries, more on that over here CTEs are Passthroughs--Some research!

Topic		Replies	Views
Your dbt Project Checklist Archive	3	19493	February 3, 2023
Resources written by community members Archive	1	5584	March 25, 2019
How do you structure your marts & database schemas? Archive	2	4730	November 15, 2021
Modifying best practices for older, complicated, somewhat messy DAGs In-Depth Discussions best-practice	0	856	October 12, 2023
Seeking Advice on Streamlining Data Models in dbt Help	1	103	August 3, 2024

How we (used to) structure our dbt projects

Related topics