How we (used to) structure our dbt projects

claire · January 16, 2020, 1:28am

Hey @maria! Great question!

Since we have control over seeds, we’ll make sure that the data is already in a “staging” format, and name it with the stg_ prefix. Since there isn’t usually a data source for these models (typically they are codified business logic), we’ll often end up with names like stg_country_codes or stg_email_exclusion_list, rather than following a stg_<source>_<object> format. We aren’t super strict on this convention though, so we’re open to feedback here!

This is kind of hacky, but you can actually document and test seeds in a .yml files in the models/ directory. We’ll improve this in a future version of dbt, but for now you can do:

version: 2
models:
  - name: stg_country_codes
    columns:
       - name: country_code
         tests:
           - unique
           - not_null

Topic		Replies	Views
Your dbt Project Checklist Archive	3	19461	February 3, 2023
Resources written by community members Archive	1	5576	March 25, 2019
How do you structure your marts & database schemas? Archive	2	4678	November 15, 2021
Modifying best practices for older, complicated, somewhat messy DAGs In-Depth Discussions best-practice	0	850	October 12, 2023
Seeking Advice on Streamlining Data Models in dbt Help	1	92	August 3, 2024

How we (used to) structure our dbt projects

Related topics