Python model injecting SQL

Jan · March 27, 2023, 12:54pm

Welcome everybody!

The problem I’m having

I just started using python models and right on the first one I am experiencing some weird behaviour. My model bases on an SQL model (which I think is valid given the documentation here). However, when generating the .py file, it injects pure SQL into the code

Input (model.py)


def model(dbt, session):
    dbt.config(materialized = "table")
    df = dbt.ref("my-sql-model")

    dx = df
    ...

Output (.py file on GCS)

from pyspark.sql import SparkSession

spark = SparkSession.builder.appName('smallTest').getOrCreate()

spark.conf.set("viewsEnabled","true")
spark.conf.set("temporaryGcsBucket","my-bucket-name")

with __dbt__cte__dictionary__xxx as (
select
    something1 as column1,
    something2 as column2
from
    `my-project`.`my-dataset`.`my-table`
where
    xxx
)def model(dbt, session):
    dbt.config(materialized = "table")
    import pandas as pd
    df = dbt.ref("my-sql-model")

    dx = df

Setup

BigQuery, GCS, dbt Cloud v1.3

There is not much I was able to do here as I really don’t understand where it is coming from. Did you ever face that issue before?

Jan · March 27, 2023, 1:59pm

I spent some more time with it and I noticed that that the base model being set to ephemeral is causing this issue. Probably, I should’ve caught it much earlier cause the SQL is pretty distinct.

Still, isn’t it somewhat of a bug? For example, ephemeral model could still be passed as a python query or there could be a validation that would prevent from using such models in python models.

joellabes · March 28, 2023, 5:46am

You’re correct that your issue is because you’re ref-ing an ephemeral model. I would open an issue on the core repo for this - I suspect this is a bug as opposed to an intentional decision to not support accessing ephemeral models. If it is intentional then it should have a better error message at least!

system · April 4, 2023, 5:47am

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Python Incremental Model Help python-models	0	1457	March 3, 2023
dbt Python model (dbt-py) best practices In-Depth Discussions best-practice , python-models	1	14335	January 19, 2023
Schema error on python-model Help snowflake , python-models	0	1243	July 31, 2023
Schema mismatch on dbt python model run Help bigquery , python-models , dbt-cloud	1	779	November 29, 2024
Python model running in Snowflake and IDE but not as dbtCloud job Help python-models , dbt-cloud	2	1482	August 24, 2023

Python model injecting SQL

The problem I’m having

Input (model.py)

Output (.py file on GCS)

Setup

Related topics