Unity Catalog Flashcards
What is Unity Catalog?
The Catalog and Governance layer for Databricks Lakehouse
What problem does Unity Catalog solve?
1/Centralized governance of data and Ai across multiple workspaces
2/Gives greater visibility through automatic data discovery and search features, permissions of the right data to right people
Who do customers care about Unity Catalog?
1/Simple
2/Cloud Agnostic (Available in all 3 public clouds)
3/Improved auditing and visibility
How can you position Unity Catalog with a customer?
1/Discuss with administrators, platform teams, conversations around data access, devops,secops
2/UC is top priority at DB, all customers should be introduced to protect the data / have permissions
How does Unity Catalog work?
1/Users and groups are resolved at account level via Account Console
2/ Unity Metastore - a top level container for UC objects like credentials, access control lists, external cloud location
3/Access control list can be managed via SQL, UI, or API
Key points of Unity Catalog?
1/ Centralized data access controls and user management
2/Robust auditing
3/Secure data sharing outside of Databricks account via Delta sharing
4/Lineage to track data in its lifecycle
5/Data-search and discovery
True or False : You can only use UC within the Databricks Platform
True. You cannot use UC outside of Databricks
True or False : UC integrates with enterprise data catalogs?
True - it does not compete or replace enterprise data catalog
What to look for when pitching Unity Catalog?
1/Strong adoption of DBSQL across multiple workspaces
2/The use of table Access Control Lists
Be careful pitching Unity Catalog around….
Heavy presence of streaming and ML workloads
True or False : Unity Catalog works with other Enterprise Catalogs?
True. Customer can keep their external metastores such as Glue
What are the requirements and Cost of Unity Catalog?
Requires Premium Tier - New costs coming in Q3. basic features will be no additional charge, premium features will be sold as add ons.