Data Engineering Fundamentals - Data Validation and Profiling Flashcards
_________________ Ensures all required data is present and no essential parts are running.
a) Completeness
b) Consistency
c) Accuracy
d) Integrity
Completeness
Checks: missing values, nulls counts, percentage of populated fields.
__________ Ensures data values are consistent across datasets and do not contradict each other.
a) Completeness
b) Consistency
c) Accuracy
d) Integrity
a) Consistency
Cross Field validation, comparing data from different sources or periods.
Ensures data is correct, reliable and represents what is supposed to.
a) Completeness
b) Consistency
c) Accuracy
d) Integrity
Accuracy
Comparing with trusted sources, validation against known standards or rules.
______________ Ensures data maintains its correctness and consistency over its lifecycle and across systems.
a) Completeness
b) Consistency
c) Accuracy
d) Integrity
Integrity