Use 3-level namespace when catalog is set. #94

ueshin · 2022-05-12T23:50:18Z

Description

Uses 3-level namespace when catalog is set.

ewengillies · 2022-05-15T08:41:33Z

thrilled to see work in this direction, I was just looking for this feature 3 days ago and happy to see its already in a PR. Is there anything I can do to help? Thanks :)

ueshin · 2022-05-23T17:45:30Z

Let me separate some changes to another PR.

ueshin · 2022-05-23T19:07:41Z

Now this is based on #98.

allisonwang-db · 2022-06-08T02:00:45Z

dbt/adapters/databricks/impl.py

@@ -67,3 +80,113 @@ def execute(
        finally:
            if staging_table is not None:
                self.drop_relation(staging_table)
+
+    def list_relations_without_caching(


It would be nice to add a new method in dbt-spark that returns the catalog field to avoid duplicate code in the future (but we need to know the release schedule for dbt-spark).

Yes, we can ask dbt-spark to make the change.
Let's have them here and work on it in a separate PR for now.

dbt/adapters/databricks/impl.py

allisonwang-db · 2022-06-08T02:03:37Z

dbt/include/databricks/macros/adapters.sql

+    {% set tmp_relation = base_relation.incorporate(path = {
+        "identifier": tmp_identifier,
+        "schema": None,
+        "database": None


If we don't specify "database": None here, what's the default value for it?

The tmp_relation will hold the database of base_relation, that results with database.identifier for the temp relation name if base_relation has database field.

Make sense. Thanks for the explanation!

allisonwang-db · 2022-06-08T02:05:52Z

dbt/include/databricks/macros/materializations/snapshot.sql

@@ -58,7 +73,7 @@

      {{ adapter.valid_snapshot_target(target_relation) }}

-      {% set staging_table = spark_build_snapshot_staging_table(strategy, sql, target_relation) %}
+      {% set staging_table = databricks_build_snapshot_staging_table(strategy, sql, target_relation) %}


Not urgent but should we use the dispatch pattern here to make it backward compatible?

That's a good point.
Actually I think if users override spark_build_snapshot_staging_table macro, it will highly likely break the catalog support.
How about using spark_build_snapshot_staging_table only when target_relation.database is None?
Also maybe we should mention it in the change log and the release note.

allisonwang-db · 2022-06-08T02:08:45Z

tests/integration/persist_constraints/test_persist_constraints.py

        # Insert a row into the seed model with an invalid id.
        self.run_sql_file("insert_invalid_id.sql")
        self.run_and_check_failure(
            model_name,
            err_msg="CHECK constraint id_greater_than_zero",
        )
        self.check_staging_table_cleaned()
-        self.run_sql(f"delete from {schema}.seed where id = 0")
+        self.run_sql("delete from {database_schema}.seed where id = 0")


Interesting, so if we use {database_schema} in the query then dbt will automatically change it to use the target database and target schema?

Actually I made some changes in tests/integration/base.py to handle database_schema.

👍 it's in transform_sql

allisonwang-db · 2022-06-09T16:55:02Z

dbt/include/databricks/macros/adapters.sql

+    {% set tmp_relation = base_relation.incorporate(path = {
+        "identifier": tmp_identifier,
+        "schema": None,
+        "database": None


Make sense. Thanks for the explanation!

allisonwang-db · 2022-06-09T16:55:51Z

tests/integration/persist_constraints/test_persist_constraints.py

        # Insert a row into the seed model with an invalid id.
        self.run_sql_file("insert_invalid_id.sql")
        self.run_and_check_failure(
            model_name,
            err_msg="CHECK constraint id_greater_than_zero",
        )
        self.check_staging_table_cleaned()
-        self.run_sql(f"delete from {schema}.seed where id = 0")
+        self.run_sql("delete from {database_schema}.seed where id = 0")


👍 it's in transform_sql

ueshin · 2022-06-09T19:34:40Z

Thanks! merging.

### Description Uses 3-level namespace when catalog is set.

ueshin changed the title ~~first try at catalog support~~ [WIP] first try at catalog support May 12, 2022

ewengillies mentioned this pull request May 15, 2022

Support for Databricks CATALOG as a DATABASE in DBT compilations #95

Closed

ueshin force-pushed the catalog branch from e515fa2 to 0e238b9 Compare May 21, 2022 01:05

ueshin changed the title ~~[WIP] first try at catalog support~~ Use 3-level namespace when setting catalog May 21, 2022

ueshin force-pushed the catalog branch from 0e238b9 to 6d7c309 Compare May 21, 2022 01:14

ueshin marked this pull request as ready for review May 21, 2022 01:15

ueshin requested review from allisonwang-db and superdupershant May 21, 2022 01:21

ueshin changed the title ~~Use 3-level namespace when setting catalog~~ Use 3-level namespace when catalog is set. May 21, 2022

ueshin marked this pull request as draft May 23, 2022 17:45

ueshin added 3 commits May 23, 2022 11:03

Block taking jinja2.runtime.Undefined into DatabricksAdapter.

b8152b6

changelog

6841705

Use 3-level namespace.

5cf7488

ueshin force-pushed the catalog branch from ab2f078 to 5cf7488 Compare May 23, 2022 19:03

ueshin added 2 commits May 23, 2022 14:59

changelog

6b6d35f

Merge branch 'main' into catalog

eef40f6

ueshin marked this pull request as ready for review May 26, 2022 00:03

ueshin mentioned this pull request May 26, 2022

Support multi-catalog #105

Merged

ueshin added 3 commits May 27, 2022 13:28

Merge branch 'main' into catalog

2e8d66f

Fix.

cfa1d5e

Fix.

804dc91

allisonwang-db reviewed Jun 8, 2022

View reviewed changes

Fix.

d2e491e

ueshin requested a review from allisonwang-db June 8, 2022 18:05

Fix.

377342c

allisonwang-db approved these changes Jun 9, 2022

View reviewed changes

ueshin merged commit 6f462d4 into databricks:main Jun 9, 2022

ueshin deleted the catalog branch June 9, 2022 19:35

ueshin added a commit to ueshin/dbt-databricks that referenced this pull request Jun 10, 2022

Use 3-level namespace when catalog is set. (databricks#94)

4e4f399

### Description Uses 3-level namespace when catalog is set.

ueshin added a commit that referenced this pull request Jun 15, 2022

Use 3-level namespace when catalog is set. (#94)

97d29bf

### Description Uses 3-level namespace when catalog is set.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use 3-level namespace when catalog is set. #94

Use 3-level namespace when catalog is set. #94

ueshin commented May 12, 2022 •

edited

Loading

ewengillies commented May 15, 2022

ueshin commented May 23, 2022

ueshin commented May 23, 2022

allisonwang-db Jun 8, 2022

ueshin Jun 8, 2022

allisonwang-db Jun 8, 2022

ueshin Jun 8, 2022

allisonwang-db Jun 9, 2022

allisonwang-db Jun 8, 2022

ueshin Jun 8, 2022

allisonwang-db Jun 8, 2022

ueshin Jun 8, 2022

allisonwang-db Jun 9, 2022

allisonwang-db Jun 9, 2022

allisonwang-db Jun 9, 2022

ueshin commented Jun 9, 2022

Use 3-level namespace when catalog is set. #94

Use 3-level namespace when catalog is set. #94

Conversation

ueshin commented May 12, 2022 • edited Loading

Description

ewengillies commented May 15, 2022

ueshin commented May 23, 2022

ueshin commented May 23, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ueshin commented Jun 9, 2022

ueshin commented May 12, 2022 •

edited

Loading