Error when attempting to convert a dataframe to pandas.

darren.hickey · January 30, 2023, 4:58pm

Hi everyone,

I’m getting the error ‘Schema has to be provided to write_pandas when a database is provided’ when attempting convert a dataframe to pandas in a python script. I’m using snowflake.

I’ve seen the following error related to using a custom schema name: [CT-1813] [CT-1378] [Bug] Python models not picking up custom schema · Issue #393 · dbt-labs/dbt-snowflake · GitHub however when testing without a custom schema name the error persists.

Anyone else experienced this?

Thanks

joellabes · January 30, 2023, 7:56pm

Hey @darren.hickey, are you able to post the code you’re trying to run?

darren.hickey · January 31, 2023, 9:46am

Hey Joel,

Yeah the offending piece of code is the following: final_df = tickets_df.to_pandas()

Thanks

joellabes · February 1, 2023, 12:28am

The good news is that issue you linked above has been resolved and will come out in dbt-snowflake 1.4.1.

We’ve only seen this with custom schema names configured - is it possible that when you changed to testing without a custom schema, a config file didn’t save or something?

troyel · February 3, 2023, 2:49pm

+1 on this-one! Any timeline for 1.4.1 ?

Kavnag · February 9, 2023, 3:02am

I am having the same issue.
Running with dbt=1.4.1

joellabes · February 9, 2023, 3:07am

dbt-snowflake v1.4.1 came out earlier today, have you checked that you have the patch for dbt-snowflake installed, not just dbt-core 1.4.1?

troyel · February 9, 2023, 9:12am

I just tested dbt-snowflake 1.4.1 and can confirm the issue still persist @joellabes . The changelog of 1.4.1 also does not address this issue, so I am not surprised.

Kavnag · February 11, 2023, 4:22am

Is there any other alternative to “to_pandas()”

darren.hickey · February 17, 2023, 11:17am

Hi Joel, we have now tried on 1.4.1 and having fully removed the custom schema name and we are still getting the error.

joellabes · February 20, 2023, 6:00pm

OK - we don’t really have enough to reproduce this, can you post the full code file (not just the final line of code) for the model you’re trying to run as well as the logs from when you try doing a dbt run?

joellabes · February 21, 2023, 10:10pm

@troyel yes my mistake, it didn’t go out in 1.4.1, but confirmed with the PM that it is actively being worked on! I don’t have a version to share sorry

paul.schmidt · March 29, 2023, 7:19pm

We’re experiencing the same issue for one of our Python dbt models that uses a custom schema. Wanted to hop on this thread to be notified of when this would be resolved! (Should also mention that we are on dbt-snowflake==1.4.1)

jerco · April 25, 2023, 8:25am

cross-posting from: [CT-1813] [CT-1378] [Bug] Python models not picking up custom schema · Issue #393 · dbt-labs/dbt-snowflake · GitHub

@patkearns10 and I managed to get to the bottom of this by live-debugging with a very helpful & generous user who was running into the issue!

I’m not sure why this bug is cropping up for some Snowflake users, and not others; I believe it should have been solved at the source in snowflake-connector-python==3.0 (included in dbt-snowflake>=1.4).

For anyone still experiencing the issue, this seems to be a valid workaround:

def model(dbt, session):
    dbt.config(schema="custom_schema")
    pandas_df = dbt.ref("my_model").to_pandas()
    
    # add these lines
    session.use_database(dbt.this.database)
    session.use_schema(dbt.this.schema)
    
    return pandas_df

We’ll see if there’s a way to include those session.use_* calls within the dbt materialization code, so that you don’t need to write it in every Python model that returns a Pandas dataframe.

system · May 2, 2023, 8:26am

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Schema error on python-model Help snowflake , python-models	0	1247	July 31, 2023
Python model running in Snowflake and IDE but not as dbtCloud job Help python-models , dbt-cloud	2	1488	August 24, 2023
Python model on dbt cloud Help python-models	0	902	February 10, 2023
ModuleNotFoundError: No module named 'pandas' Help snowflake , python-models	5	2891	November 2, 2022
Schema mismatch on dbt python model run Help bigquery , python-models , dbt-cloud	1	784	November 29, 2024

Error when attempting to convert a dataframe to pandas.

Related topics