18. Data Flashcards

1
Q

Define ‘personal data’

A

Data where an individual could be identified when combined with other data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What allows the collection & storage of vast amounts of data

A

Improved technology

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How can data legislation vary by jurisdication?

A

Objectives & expected behaviour are similar but legislation varies. The US has much less stringent laws

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why must extra care be taken when transferring data between countries?

A

Data legislation can vary between jurisdiction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Name the 8 categories of ‘sensitive personal data’ (RREPPCST)

A
  • Race
  • Religion
  • Ethnicity
  • Political opinion
  • Physical/mental condition
  • Convictions
  • Sex life
  • Trade union membership
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Name the 3 qualities by which big data can be categorised

A
  • Very large datasets
  • Brought together from many sources
  • Can be analysed quickly
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How can big data be altered to provide data protection?

A

Anonymisation can remove any personal data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the theory of data minimisation?

A

That big data is excessive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Complexity of big data is not an excuse for…

A

failure to comply

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Define ‘data governance’ SIAU

A

The term used to describe overall management of availability, usability, integrity & security of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a data governance policy?

A

A documented set of guidelines for data management, detailing how data is captured, analysed & processed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What 6 things does a data governance policy detail? RUSCCM

A
  • Use for data
  • Roles/responsibilities of individuals
  • How data is captured, analysed & processed
  • Security/privacy issues
  • Details of controls to meet standards
  • How adequacy of controls is monitored
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Name 3 risks of poor data governance

A
  • Fines
  • Reputational damage
  • Inability to rely on data for use
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What should the data governance policy detail regarding a merger?

A

The risk of aggregating data & data systems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Give an advantage of combining data & data systems in a merger

A

Adv: overhead savings
Disadv: cost of converting systems is high

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Name 5 risks around data & its suitability for use

A
  1. Errors/omissions
  2. Insufficient data
  3. Credibility
  4. Not reflective of future experience
  5. Form - not in required form for purpose
17
Q

Give 5 reasons why data may not be a reflection of future experience

A
  1. Random fluctuation
  2. Abnormal events in data
  3. Changes to data recording
  4. Change in balance of homogenous grouping
  5. Socio-economic change
18
Q

What is the result of a lack of confidence in data

A

A lack of confidence in conclusions

19
Q

What is the issue with very small homogenous groups

A

They can be too small to draw credible conclusions & if merged to form sufficiently sized groups, it may reduce homogeneity

20
Q

What is algorithmic decision making?

A

Automated trading to capitalise on price discrepancies across markets

21
Q

List the benefits of algorithmic decision making

A
  • Quicker, more consistent decision making
  • Lower dealing cost
22
Q

How can advancements in big data aid algorithmic decision making?

A

Allows for greater accuracy in setting parameters

23
Q

Name 6 risks of algorithmic decision making

A
  • Algorithm error
  • Data error
  • Creating instability in markets (plunge & rebound)
  • Turbulent conditions can cause market suspension
  • May not operate in turbulent markets
24
Q

Name 3 reasons why data for all tasks should be controlled through a single system

A
  • Audit trail
  • Easier access
  • Lower chance of data corruption
25
Q

Why might competitor data be limited in its usefullness?

A
  • Different benefits offered
  • Difference in target market
  • Difference in approach to valuation (prudence in CBE)
26
Q

Why does it take a long time to accumulate good data?

A

Data takes many years to accumulate so must have good systems in place

27
Q

What kind of questions are used on the proposal form & why?

A
  • Tick boxes to be easily entered
  • Unambiguous for accurate information
  • Rating factors used to translate qualitative to quantitative
28
Q

Who provides the data used in employee benefit schemes?

A

Sponsor (employer)

29
Q

Why can data be a particularly prevalent issue in employee benefit schemes?

A

Provided by sponsor (employer) who may not have sufficiently detailed or reliable data

30
Q

Name 3 good checks on data

A
  • Checks against data from past valuation date
  • Checks against accounting data
  • Assertations
31
Q

Name the 3 things to be attested to

A
  • Appropriate valuation date
  • Complete
  • Assets/liabilities exist on given date
32
Q

How can data be checked if it’s not possible to check an entire dataset?

A

Random spot checks

33
Q

What is important to note when using summarised data (summarised due to insufficient volume or detail)

A

Recognise the reliability of results will be impacted

34
Q

Give an example of an industry-wide data collection scheme

A

IFoA PPO working party

35
Q

Give 2 benefits of using industry-wide data collection scheme

A
  • Can compare experience with the industry
  • Can compare homogenous groupings
36
Q

Name 6 potential causes of heterogeneity distortion in industry-wide data collection schemes

A
  • Different policies sold
  • Different sales methods
  • Different underwriting process
  • Different risk factors
  • Different data systems
  • Different socio-economic conditions
37
Q

Give 3 other distortions in industry-wide data collection schemes

A
  • out-of-date
  • less detailed
  • not all firms participate so not fully market representative