8 - Data quality Flashcards

1
Q

What is data quality?

A

A measure of how well data represents real-world phenomena for business purposes

The four dimensions of quality are accuracy, validity, accessibility, and timeliness.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the four dimensions of data quality?

A
  • Accurate
  • Valid
  • Accessible
  • Timely
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why is data quality important for businesses?

A

Improves effectiveness and trust in data, leading to better decision-making and opportunities

Poor data quality can result in significant financial losses, as illustrated by past business failures.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What can poor data quality lead to in a business?

A
  • Missed opportunities
  • Poor decision-making
  • Increased complaints
  • Regulatory issues
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What was one major consequence for British Gas due to poor data quality?

A

The company wrote off £200 million in 2008 due to customer complaints and lost a million customers

Complaints primarily involved billing issues.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a potential risk of poor GDPR compliance?

A

Having multiple records of the same customer, leading to incomplete data deletion requests.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does Principle (d) of GDPR state?

A

You should ensure personal data held is not incorrect or misleading

This principle emphasizes the importance of data accuracy.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is one example of a failure due to poor data quality?

A

Marketing targeting the wrong customers, leading to low response rates

This can result in wasted resources and missed revenue opportunities.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the first step to improve data quality in a project?

A

Convincing the business that improving data quality is important and useful.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are some benefits of high-quality data?

A
  • Creates efficiency
  • Eliminates errors
  • Improves decision-making
  • Enhances security
  • Provides quality reporting
  • Facilitates linking and sharing
  • Allows honest appraisal
  • Meets legal obligations
  • Measures performance
  • Controls budgets
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does the availability of data mean in the context of data quality?

A

Data users need relevant data to make decisions, which should be accessible as soon as it becomes available.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is meant by the timeliness of data?

A

Data should be captured and available quickly enough to support effective performance management.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How can accuracy of data be achieved?

A

By capturing data as close to the point of service delivery as possible.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does ‘COUNT’ stand for in the context of data accuracy?

A

Collect Once, Use Numerous Times.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Fill in the blank: Poor-quality data means that a business will miss potential opportunities to _______.

A

[grow]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

True or False: Poor data quality can lead to prosecution.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is a common issue with small cohorts of customer data?

A

They may be unbalanced and not representative of the larger population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is a potential consequence of complex or irrelevant performance indicators?

A

They may be misunderstood or misreported.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What can a limited data quality audit provide?

A

A useful quick win that can lead to more strategic initiatives.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What was discovered about a marketing list that had never been investigated?

A

20 percent of the customers were deceased, leading to wasted marketing resources.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is essential for ensuring data quality across different business units?

A

Recognizing that information requirements vary, but the need for good-quality data does not.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is the relationship between data consistency and real-world processes?

A

Data that is consistent is more likely to reflect the real-world process that generates it, and so can be used with higher confidence when you make decisions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What must be balanced with the importance of data uses?

A

The costs and effort of collecting it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Why is it important for users of data to know about compromises in data accuracy?

A

So they don’t assume that accuracy is greater than it is.

25
Q

What critical data is needed for businesses regarding customer age?

A

Date of birth to calculate age accurately.

26
Q

What does validity of data refer to?

A

Recording and reporting data in ways that comply with compliance requirements or match internal standards.

27
Q

How can organizations ensure data validity?

A

Through data governance policies.

28
Q

What practical consideration must be taken when capturing data?

A

The method of acquisition, such as the document or system used.

29
Q

What is a data quality strategy?

A

A plan to improve data quality by creating systems that ensure checks, validation, and automation.

30
Q

What should a data quality strategy account for?

A

Different opinions depending on geographical location or business unit.

31
Q

What might sales teams prioritize in data capture?

A

Only the data needed to make the sale.

32
Q

What can regulation, like GDPR, limit?

A

What data can be collected and stored.

33
Q

Why is it important for the entire business to understand data quality?

A

To meet targets and improve overall business performance.

34
Q

What is necessary to measure before improving data quality?

A

Establishing a baseline of current data quality.

35
Q

What critical data items should be focused on for customer data?

A
  • Name
  • Address
  • Email address
  • Telephone number
36
Q

What does a quality score of 3 indicate about data?

A

The data is of high quality and meets the needs of all users.

37
Q

What does a quality score of 1 indicate about data?

A

The data is widely known to be inaccurate and not trusted for decision-making.

38
Q

What is the role of a data quality leader?

A

To analyze data quality and make recommendations for improvements.

39
Q

What does the data quality improvement team do?

A

Identifies challenges damaging data quality and analyzes statistics on data quality.

40
Q

What is data cleansing?

A

Removing obvious typing or cut-and-paste errors from data.

41
Q

What is the purpose of a data quality audit?

A

To measure data quality using specific metrics.

42
Q

What is a useful device for communicating data quality success?

A

A dashboard.

43
Q

What traffic-light measure can be used for data fields?

A
  • Red for less than 40% correct data
  • Amber for 40% to 70%
  • Green for greater than 70%
44
Q

What should targets for data quality improvement be?

A

Stretch but achievable.

45
Q

What is the purpose of external data in customer communication?

A

To determine if customers are unresponsive due to address changes, lack of replies, or passing away

External data helps clarify customer engagement issues.

46
Q

What is data cleansing?

A

A quick fix to remove obvious errors, enhancing data quality

It involves correcting typing or cut-and-paste errors.

47
Q

What can be captured from customers to improve data quality?

A

Email addresses, communication preferences, and relevant details

Contacting customers can provide valuable data insights.

48
Q

What is the task for improving long-term data quality?

A

Address the processes that create poor-quality data

Continuous improvement is needed for data creation processes.

49
Q

What are common reasons employees may not capture high-quality data?

A

Lack of training and understanding of data usage

Employees may view data capture as an inconvenience.

50
Q

What is often not formalized as part of job descriptions regarding data?

A

Ensuring data quality

There is typically no recognition for data accuracy efforts.

51
Q

What is a consequence of poor communication about data quality?

A

Continued capture of poor-quality data

Employees may not understand the importance of accurate data.

52
Q

What is essential for machine learning and AI success?

A

High-quality data for training

Quality of data significantly impacts machine learning outcomes.

53
Q

What are simple fixes to improve data entry accuracy?

A
  • More data validation
  • Address lookups using postcodes

Encouraging customers to enter their own data can also enhance accuracy.

54
Q

What are the four dimensions of data quality?

A
  • Accuracy
  • Validity
  • Timeliness
  • Accessibility

Measuring these dimensions is crucial for assessing data quality.

55
Q

What are the costs associated with poor-quality data?

A

Underperformance and elevated risk

The impact of poor-quality data may not be immediately apparent.

56
Q

What can be a more effective approach to gain budget for data quality projects?

A

Selling the positive benefits of high-quality data

Demonstrating achievable goals with quality data can attract attention.

57
Q

Fill in the blank: Data quality improvement needs its own _______.

A

[project]

Governance improvements alone are insufficient.

58
Q

What is a significant data error example mentioned?

A

Deutsche Bank accidentally transferred $35 billion to an outside account in 2018

This highlights the risks of data errors.

59
Q

What should organizations aim for regarding data acquisition processes?

A

Higher-quality acquisition processes for long-term improvement

These processes should be sustainable and beneficial.