Integrating python with dbt-core

chakresh.singh · March 14, 2025, 9:31am

Hi all,
I have recently began using dbt-core for data processing at our company. When working with raw or evolving datasets, I often find it easier to perform initial data exploration and preprocessing in Python before structuring transformations in dbt. This helps in understanding the data better before defining dbt models. However, I wonder if this hybrid approach is the best practice or if there are better ways to integrate Python into a dbt workflow.

Some specific questions I have:

Is it common to use Python for raw data preparation before feeding it into dbt?
How do teams typically manage schema inference when working with unknown datasets?
Are there recommended ways to combine dbt’s SQL-based transformations with Python-based processing (e.g., via dbt Python models, external preprocessing scripts, or other tools)?

Would love to hear how others balance Python and dbt in their workflows!

Topic		Replies	Views
dbt Python model (dbt-py) best practices In-Depth Discussions best-practice , python-models	1	14427	January 19, 2023
What is the roadmap for Python models? In-Depth Discussions python-models , developer-ergonomics	1	1765	May 10, 2024
DBT model that contains both SQL and Python Show and Tell testing , snowflake , dbt-core	0	265	February 4, 2025
DBT Python Model for DBT-Postgres Adapter Help python-models , postgres , dbt-core	4	426	July 29, 2024
While building dbt-core- sql-python pipe with following code in .py file , getting error as - "dbt allows exactly one model defined per python file, found 0" Help dbt-core	0	596	February 5, 2024

Integrating python with dbt-core

Related topics