Training, saving, and running machine learning workloads with dbt/Snowflake

maurits · April 3, 2023, 8:34pm

TLDR: For those of you deploying data science models using dbt & Snowflake, which parts of the model do you run in dbt vs snowflake?

Hi,

We’re working on a project that involves training a predictive machine learning model in Snowflake. We create our training dataset by transforming raw data with SQL in dbt, and then want to

train an ML model on this dataset and save the model, e.g. as a pickle file
separately, load the model and make predictions

We know it’s possible to run Python with dbt, and we’ve successfully trained the model in dbt. In terms of storing objects, we can think of using

Snowflake internal stages
External stages, such as S3

We’d be eager to understand whether there are any best practices for running SQL + Python workloads in Snowflake involving saving and then loading in machine learning models?

Thanks!

Topic		Replies	Views
DBT model that contains both SQL and Python Show and Tell testing , snowflake , dbt-core	0	297	February 4, 2025
How to use dbt with Snowflake as source but write models locally ? Help	0	64	June 23, 2025
Using 'boto3' with Python models on Snowflake Help snowflake , python-models , dbt-core	7	323	December 13, 2024
Snowpipe with dbt Help snowflake , macro , macros	0	2612	August 16, 2023
ALDS to Snowflake migration Help snowflake , dbt-core	0	37	February 19, 2025

Training, saving, and running machine learning workloads with dbt/Snowflake

Related topics