How to Parse dbt-core Run Logs for Execution Statistics (Success/Failure Counts, Error Reasons, Affected Rows, etc.)

Vin · May 30, 2025, 10:08am

I’m exploring ways to parse dbt-core run logs (JSON format) to extract execution statistics programmatically. Specifically, I’d like to:

Count Success/Failure : Identify how many models passed/failed (e.g., from PASS=2 WARN=0 ERROR=0 in logs).
Error Diagnostics : Extract error messages/reasons for failed models (e.g., SQL errors, connection issues).
Impact Analysis : Determine affected rows (e.g., SELECT 4 for a table model) or DDL operations (e.g., CREATE VIEW ).
Timing Metrics : Compile per-model timing (compile/execute phases) and total run duration.

From my testing, I’ve observed:

Final stats like PASS/ERROR appear in logs with code: "Z023" or "E047" .
Model-specific results include adapter_response (e.g., rows_affected ) and status fields.
Errors may appear with level: "error" and stack traces.

Questions:

Are there consistent patterns/log codes to reliably extract these metrics?
How do you handle logs for partial runs (e.g., --fail-fast or interrupted jobs)?
Are there hidden fields (e.g., in run_results.json ) that simplify this analysis?
Any tools/libraries (e.g., Python parsers) you recommend for log processing?

a_slack_user · May 30, 2025, 12:07pm

Try using dbt_artifacts package

_{Note: @prasad0413 originally posted this reply in Slack. It might not have transferred perfectly.}

a_slack_user · May 30, 2025, 1:07pm

I agree with prasad0413; https://hub.getdbt.com/brooklyn-data/dbt_artifacts/latest/|dbt_artifacts parses the dbt execution artifacts and uploads them into tables in your database. If you want to roll your own version, though, you can take inspiration from how they’ve implemented it. https://github.com/brooklyn-data/dbt_artifacts/tree/2.9.3

_{Note: @Owen originally posted this reply in Slack. It might not have transferred perfectly.}

Vin · June 6, 2025, 7:20am

Thank you very much, but I think this is just a simple run result data, and I don’t want to obtain it in such a complicated way. Currently I’m trying to parse the log content to process it.

Vin · June 6, 2025, 7:23am

I just went to learn about it, and it’s more like a CDC related feature. And I just want to quickly know the result of each execution, just like getting the number of rows affected after SQL is executed. This bulky approach is not for me.

Topic		Replies	Views
dbt debug log analysis In-Depth Discussions	4	2452	October 17, 2022
run_results.json is not logging models executed from dbt run Help	2	1651	March 28, 2023
dbt Cloud API get to return job status and which models failed Help dbt-cloud , api	4	1405	June 25, 2024
Log rows affected for every model execution Help incremental , best-practice , bigquery , macros , dbt-core	4	1106	August 16, 2024
Parse log during dbt execution and insert using model Help bigquery	2	934	May 4, 2023

How to Parse dbt-core Run Logs for Execution Statistics (Success/Failure Counts, Error Reasons, Affected Rows, etc.)

Related topics