Best Practices for CICD Deployment

dave_connors · October 3, 2019, 7:33pm

Hey there!

I am a relatively new dbt user who is nearing the stage of deploying to production, and my team is going to use CI/CD on gitlab. Are there any best practices or words of wisdom we should be aware of as we start setting this up? I’l be working with a dev who is much better at CI/CD but has almost no exposure to dbt architecture/set up.

Thanks in advance!

tmurphy · October 3, 2019, 8:13pm

dbt cloud is a great option depending on the size of your team and data engineering maturity.

At GitLab, we run dbt in production via Airflow. Our DAGs are defined in this part of our repo. We run Airflow on Kubernetes in GCP. Our Docker images are stored in this project.

For CI, we use GitLab CI. In merge requests, our jobs are set to run in a separate Snowflake database (a clone). Here’s all the job definitions for dbt. The rest of the CI pipeline is defined here.

General principles, I think, are that you want to have your MRs run dbt using real data but writing to either a dev schema or a separate DB clone like we do. If you make dbt reference environment variables for where to write then you can control it quite nicely that way. (See our profile here for details on that).

Hope this is useful!

yu-iskw · August 12, 2020, 11:35am

@tmurphyThank you for sharing the knowledge. That would be super helpful. I was wondering how to pass airflow’s macros, especially {{ ds }} and {{ execution_date }}. A possible solution I was thinking was using environment variables. So, I was encouraged, seeing the repository. Many thanks!

Topic		Replies	Views
Airflow Integration Archive	1	4567	May 28, 2020
What is the best practice for deploying Airflow together with dbt? In-Depth Discussions airflow , orchestration-and-deployment	1	5727	December 29, 2020
Deploying just one dbt model using GitLab CI/CD to Snowflake Help ci-cd , snowflake	1	1064	January 23, 2023
Dbt Orchestration: Airflow vs Dbt Jobs Help best-practice , bigquery , airflow , dbt-cloud	6	5362	January 17, 2023
dbt CLI vs dbt Cloud - convince me! Archive	5	9477	March 30, 2022

Best Practices for CICD Deployment

Related topics