schema.yml reading a view as a source

lbert · December 13, 2022, 1:30am

For the yml file, are sources view type that can be used as a source? Our source systems have logical views, and we don’t want to duplicate the complex logic in dbt schema.yml to avoid sync issues. It seems that yml doesn’t have a views type option.

Models are materializing as a view, so it compiles a nested view reading a view or CTA reading a view. Hence, it produces an unnecessary nested view redundancy. Is there another better practice?

Thanks,
Laura B

Sample schema.yml

sources:
  - name: table label
    schema: schema name
    tables:
      - name: database table name

joellabes · December 13, 2022, 9:08pm

Welcome @lbert!

Does this mean you are building your dbt project inside of the same production database where the data is created, as opposed to using an ETL tool to move the data into a separate analytics warehouse?

Can you give an example of what you mean here? There won’t be any complex transformation logic in a yaml file, and there is no yaml in a view definition so I don’t understand what logic you’re worried about duplicating.

In the dbt paradigm, it’s extremely common to have a stack of views reading from one another, as this enables you to break up your modelling logic into individual steps. Taken to an extreme this results in performance issues, so we recommend limiting the number of views chained together without a table along the way, but otherwise this isn’t an antipattern or something you need to worry too much about.

lbert · December 14, 2022, 4:44pm

Thank you for your response. There is a table_type option. The data provider owns the source view, and it contains business logic. DBT project can read a view vs creating a model that uses view and then creates a view of view.

scheme.yml
sources:

name: bi
schema: bi
tables:
- name: fact_email_funnel
  identifier: fact_email_funnel
  table_type: table
  description: email campaign engagement metrics
- name: fact_clickstream
  identifier: fact_clickstream
  table_type: view
  description: web clickstream events

joellabes · December 15, 2022, 1:40am

Can you link me to the dbt documentation where you found this setting? I don’t see it in the source properties documentation and have never heard of it.

Regardless, you can query sources that are views if you want to. Our modelling best practice strongly recommends using a staging layer to ensure that you only directly access sources once (and then build other models on top of the staging layer), but you can query them directly if you want to remove a layer of abstraction.

If you choose to go down that path, keep in mind that dbt projects pretty much always have views stacked on top of each other. It’s not clear to me why it’s important to you that the source not be a view on top of a view, when your downstream models are very very likely to be views on top of views anyway.

lbert · December 15, 2022, 6:55pm

I understand that one of DBT mainframe work is stacked views. In the past, we have seen performance issue with nested views. Sometimes phyical tables are the best options.

The great benefit of views is having business logic can be change quickly with litle impact and quick turnaround for the stackholders and little backfill efforts.

Topic		Replies	Views
dbt tries to materilize downstream models incorrectly as specifies in dbt_project.yml Help dbt-core	0	810	December 4, 2023
How do I specify a different schema for my source at run time? Help variables , yaml	3	19827	September 9, 2019
Cant change materialized from view to table Help	4	896	May 19, 2024
How to overwrite schema set in `profile.yml`? Archive	4	8547	December 1, 2021
Dbt Model Not Picking Up Source Schema From source.yml File Help postgres , dbt-core	7	195	May 13, 2025

schema.yml reading a view as a source

Related topics