Chapter 22 Flashcards

1
Q

What is quantify

A

Measurement of quantity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Does data quantify subjectivity or objectivity

A

On both

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are methods for data quantify objectively

A
  • Dependent

- Independent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is independent objective data quantification

A

Data is independent. Data does not effect by organization rules

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is dependent objective data quantification

A

Data is dependent and effect by organization rules

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Please tell more data dimension

A
  • Believability
  • Appropriate amount of data
  • Timeliness
  • Accessibility
  • Objectivity
  • Interpretability
  • Uniqueness
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are data quality assessment techniques

A
  • Min-max
  • Simple ratio
  • Weighted average
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is positive simple ratio

A

It is the ratio of desirable records with reference of total number of records subtract from 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is negative simple ratio

A

It is the ratio of undesirable records with reference of total number of records subtract from 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What simple ratio is used in longitudinal analysis

A

positive simple ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is min-max data quality assessment technique

A

It relates to set of data and min or max of them. First we convert attributes of data normalize and then we take min max. When we take minimum value we are conservative and when we take maximum we are liberal.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is free of error ratio:simple ratio

A

Negative simple ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is completeness of data:simple ratio

A

Completeness of data can be measured through 3 aspects.
1- Schema
2- Column completeness
3- Population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is consistency:simple ratio

A

There are 2 types of consistency
1- Variation (e.g. karachi, khi, KHI, Karachi)
2- Functional integrity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are min-max measurements

A
  • Believability
  • Appropriate amount of data
  • Timeliness
  • Accessiblity
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Does believability a minimum (conservative) value

A

Yes

17
Q

What is the formula of timeliness

A

Max {0, 1-CM}
C = A + Dt - It
(Dt = Delivery time, It = Input time)

18
Q

What is formula of accessibility

A

Max {0, 1-trd/tru}

19
Q

What comes first assessment or validation

A

assessment

20
Q

What are methods for validation

A

1- Referential integrity validation (records without reference)
2- Attribute domain validation
3- Business rules
4- Analyze data

21
Q

What is data profiling

A

Data profiling is the process of examining the data available from an existing information source (e.g. a database or a file) and collecting statistics about that data.

22
Q

What are orphan records

A

Records without reference

23
Q

What are 3 steps for attribute domain validation

A

Step 1- Capture and quantify
Step 2- Compare
Step 3- Investigate

24
Q

What is histogram

A

History of data in DWH. It can be represented in graph form.