Data Vault #
Introduction #
The Data Vault modeling approach is a hybrid that features aspects of both the third normal form (3NF) and star schema. It is designed for large-scale data warehouses and aims to provide long-term historical storage of data coming from multiple operational systems, making it highly flexible and scalable.
Characteristics: #
- Robustness: Highly adaptable to changes in the environment, making it suitable for enterprises where change is constant.
- Complexity: Consists of three types of tables: Hubs (key business concepts), Links (associations between hubs), and Satellites (descriptive attributes and historical versions).
- Scalability and Flexibility: Excellent scalability and the ability to integrate disparate systems over time without requiring significant redesign.
Learning Resources #
Books #
- Modeling the Agile Data Warehouse with Data Vault
- Building a Scalable Data Warehouse with Data Vault 2.0
Courses #
- Data Vault vs Traditional Data Warehouse Architectures
- What is a Data Vault ? | 3NF vs Dimensional model vs Data Vault | Quick Starter Guide in 2023
- Comparing 3 Types of Data Modeling (Normalized vs Star Schema vs Data Vault)
- A brief introduction to Data Vault 2.0 - Part 1 of 7 - Introduction
- How to create a Data Vault Model from scratch
- Understand the Data Vault Architecture