Hi,
I’m trying to get started with dbt, but I’m stuck:
I have Snowplow data in BigQuery and I could setup a profile, install the snowplow packages and build a dbt_projects.yml file.
Now when I do a test run, I get error messages:
Database Error in test unique_snowplow_base_events_event_id (models/base/schema.yml)
Table name "pageviews" missing dataset while no default dataset is set in the request.
compiled SQL at target/compiled/snowplow/schema_test/unique_snowplow_base_events_event_id.sql
Database Error in test not_null_snowplow_base_events_app_id (models/base/schema.yml)
Table name "pageviews" missing dataset while no default dataset is set in the request.
compiled SQL at target/compiled/snowplow/schema_test/not_null_snowplow_base_events_app_id.sql
Database Error in test not_null_snowplow_base_events_event_id (models/base/schema.yml)
Table name "pageviews" missing dataset while no default dataset is set in the request.
compiled SQL at target/compiled/snowplow/schema_test/not_null_snowplow_base_events_event_id.sql
and more of course…
I’m not sure where to set the dataset other than the profiles.yml file? Am I missing something here?
This is my profile:
my-bigquery-snowplow-db:
target: prod
outputs:
prod:
type: bigquery
method: oauth
project: hereismygcpprojectname
dataset: snowplow
threads: 1
timeout_seconds: 300
location: EU
priority: interactive
And my dbt_projects.yml:
#settings specifically for this models directory
#config other dbt settings within ~/.dbt/profiles.yml
name: 'snowplow'
version: '0.0.1'
source-paths: ["models"]
target-path: "target"
clean-targets: ["target"]
test-paths: ["test"]
analysis-paths: ["analysis"]
data-paths: ["data"]
macro-paths: ["macros"]
models:
snowplow:
base:
materialized: ephemeral
optional:
enabled: false
page_views:
optional:
enabled: false
vars:
'snowplow:events': pageviews
'snowplow:context:web_page': null
#'snowplow:context:performance_timing': TABLE OR {{ REF() }} or FALSE
#'snowplow:context:useragent': TABLE OR {{ REF() }} or FALSE
'snowplow:timezone': 'Europe/Berlin'
'snowplow:page_ping_frequency': 10
'snowplow:app_ids': ['MZ Webseite']
'snowplow:pass_through_columns': []
My dataset in Bigquery:
hereismygcpprojectname:snowplow:pageviews
I’m wondering what’s wrong? Can you help me?
Cheers
Andreas