How I Used ChatGPT to Auto-Generate dbt Model Descriptions from SQL Logic

benof · July 8, 2025, 11:50am

Hello

One small but impactful improvement I have made to our dbt workflow is automating the generation of model descriptions by using ChatGPT to analyze the underlying SQL. Descriptions are often neglected / rushed; yet they’re incredibly important for data discovery and documentation.

I built a simple Python script that sends the contents of .sql model files to the OpenAI API & returns concise, human-readable descriptions, which I then inject into schema.yml.

This saves a lot of manual writing and helps maintain consistency across the team. It’s particularly useful when onboarding new team members / reviewing legacy models.

I have set up the script to run locally but I’m exploring adding it to our CI pipeline so model documentation stays fresh with every commit. Has anyone else tried something similar or taken it a step further with test or exposure generation? I checked About documentation | dbt Developer Hub guide for reference .

When a teammate asked me what is ChatGPT , this small tool I hacked together turned out to be the best example of practical usage turning raw SQL into structured, documented knowledge. Sharing here in case others want to improve model documentation workflows using LLMs.

Thank you !!

Topic		Replies	Views
Here is a way to write dbt docs as SQL comment Show and Tell dbt-docs	1	7989	September 27, 2020
Accelerate your documentation workflow: Generate docs for whole folders at once In-Depth Discussions devblog	3	1771	May 31, 2024
Dynamically create staging dbt yaml and sql from JSON Schema file Help	0	50	September 12, 2025
Dynamic model generation Help	1	688	December 9, 2024
Documentation best practice In-Depth Discussions dbt-core	0	987	March 11, 2024

How I Used ChatGPT to Auto-Generate dbt Model Descriptions from SQL Logic

Related topics