Beginner Flashcards
Q: What is Data Vault 2.0?
A: Data Vault 2.0 is an agile, scalable, and flexible approach to data warehousing that focuses on capturing and integrating raw data from various sources for reporting and analytics.
Q: What are the two main components of Data Vault?
A: The two main components are Hubs (business concepts) and Links (relationships between hubs), along with Satellites, which store descriptive data about the hubs and links.
Q: What is a Hub in Data Vault?
A: A Hub is a central table that stores unique business keys to identify a core entity, such as customers or products, along with metadata for tracking changes.
Q: What is a Link in Data Vault?
A: A Link is a table that represents the relationships between Hubs, allowing for the modeling of many-to-many relationships within the data warehouse.
Q: What is a Satellite in Data Vault?
A: A Satellite contains descriptive attributes and historical data related to a Hub or Link, capturing changes over time while providing context to the data.
Q: Why is Data Vault considered agile?
A: Data Vault is Agile because it allows for iterative development, enabling quick integration of new data sources and changes to existing structures without affecting the overall system.
Q: What is the role of Business Keys in Data Vault?
A: Business Keys uniquely identify records in Hubs and are essential for ensuring data consistency and integrity across different data sources.
Q: What advantages does Data Vault 2.0 offer over traditional data warehousing approaches?
A: Advantages include improved flexibility, scalability, better handling of changes, support for agile methodologies, and a focus on auditability and traceability.
Q: What is the purpose of Hash Keys in Data Vault 2.0?
A: Hash Keys reduce the size of the primary key and provide a consistent way to manage and compare records, making it easier to track changes without the risk of duplication.
Q: How does Data Vault handle slow-changing dimensions?
A: Data Vault handles slow-changing dimensions through the use of Satellites, which capture and store historical changes without altering the structure of existing Hubs and Links.