Ch 19: Data Flashcards

Question 1

Q

Define personal data

Answer

A

Personal data is information that relates to an individual which would allow that individual to be identified, or where the data combined with other information could allow the individual to be identified

Question 2

Q

Eight principles which must be followed when processing personal data

Answer

A

Personal data must:
1. Be processed fairly and lawfully
2. Be obtained and processed for specified purposes
3. Be adequate, relevant and not excessive for the purposes concerned
4. Be accurate, and where necessary, kept up to date
5. Not be kept longer than necessary for the purposes concerned
6. Be processed in accordance with the individual’s rights
7. Be processed securely
8. Not be processed to another company or country unless that party ensures an adequate level of protection

Question 3

Q

Examples of what might count as ‘sensitive personal data’

Answer

A

Sensitive personal data can include information related to:
1. Racial or ethnic origin
2. Political opinions
3. Religious or other beliefs
4. Membership of trade unions
5. Physical or mental health or condition
6. Sexual life
7. Convictions, proceedings and criminal acts

Question 4

Q

Give examples of circumstances when sensitive personal information may be legitimately processed

Answer

A

The data subject has given explicit consent
It is required by law for employment purposes
It is needed in order to protect the vital interests of the individual or another person
It is needed in connection with the administration of justice or legal proceedings

Question 5

Q

State three characteristics of ‘big data’

Answer

A

Big data can be characterised by:
1. The data sets are very large
2. Data is brought together from different sources
3. Data can be analyzed very quickly, for example in real time

Question 6

Q

State four risks to a company not having adequate data governance procedures

Answer

A

Legal and regulatory non-compliance
Inability to rely on data for decision making
Reputational issues, leading to loss of business
Incurring additional costs such as fines and legal costs

Question 7

Q

Define ‘data governance’ and list the guidelines that a data governance policy may cover

Answer

A

Data governance – the overall management of the availability, usability, security and integrity of data employed in an organization

A data governance policy will set out guidelines with regards to:
1. The specific roles and responsibilities of individuals in the organization with regards to data
2. How an organization will capture, analyze and process data
3. Issues with respect to data security and privacy
4. The controls that will be put in place to ensure that the required data standards are applied
5. How the adequacy of controls will be monitored on an ongoing basis with respect to data usability, accessibility, integrity and security
6. Ensuring that the relevant legal and regulatory requirements in relation to data management are met by the organization

Question 8

Q

List the main sources of data

Answer

A

TRAINERS

Tables
Reinsurers
Abroad (data from overseas contracts)
Industry data
National statistics
Experience investigations on the existing contract
Regulatory reports and company accounts
Similar contracts

Question 9

Q

What is the overriding principle in relation to all the different uses of data?

Answer

A

There should be one single, integrated data system so that the data used for different applications is consistent

Question 10

Q

Define algorithmic trading

Answer

A

This is a form of automated trading that involves buying and selling financial securities electronically to capitalize on price discrepancies for the same stock or asset in different markets.

(can also refer to high frequency trading)

Question 11

Q

Explain the risks of algorithmic trading

Answer

A

Errors in the algorithm or data used to parameterize the model, leading to losses
The algorithm may not operate properly in adverse conditions
In very turbulent conditions, trading in individual stocks or markets may be suspended before algorithmic trade can be completed
Possible impacts on the financial system - failure of one market could impact other markets and asset classes..

Question 12

Q

List the key risks associated with using data

Answer

A

Data are inaccurate or incomplete, leading to erroneous results or conclusions
Data are not credible due to insufficient volume, particularly for extreme outcomes.
Data are not sufficiently relevant to the intended purpose
Historical data do not reflect what will happen in the future (abnormal events; significant random fluctuations; not up to date; homogeneous groups change)
Chosen data groups are not optimal
Data are not available in an appropriate form for the intended purpose
Lack of confidence in the data leads to a lack of confidence in the results obtained from using it

Question 13

Q

What two main factors cause data to be of poor quality and quantity?

Answer

A

Poor management control of data recording and checking
Poor design of data systems

Question 14

Q

How can good quality data be ensured from an insurance proposal and claims form?

Answer

A

Questions should be well designed and unambiguous so that full information is given and so that applications / claims can be easily processed
Use questions with quantitative or tick-box answers wherever possible
Questions should be in the same order as the input into the administration systems, for quick processing of applications / claims
Ask the policyholder to verify the key information
All rating factors should be readily identifiable so that the composition of the final premium can be determined
Underwriting results should be added to the proposal form
Forms should be designed so that information can be easily analyzed, and cross checks made between the two sources

Question 15

Q

Why is it important, at the time of the claim, to have access to the information given on the proposal form?

Answer

A

To check the validity of the claim
To update policy information

Question 16

Q

Why is it important that the insurance company retains a past history of policy and claims records?

Answer

A

When an insurance company analyses past experience in order to help set future assumptions, several years’ worth of data are often needed in order to give a sufficient volume of data, or to identify trends

Question 17

Q

What is the key problem with data for employee benefit schemes?

Answer

A

The actuary does not have full control over the data, as it is provided by the sponsor

The consequences of this may be poor quality or summarized data

NOTE: It is therefore particularly important to validate this type of data

Question 18

Q

What four sources of data are useful in order to conduct a valuation of a benefits scheme?

Answer

A

Membership data on individuals who are currently receiving benefits and those who are entitled to in the future
Data from the previous valuation for reconciliation with current data to help validate the current data
Accounting data for information on asset values, benefit outgo and contribution income to help check other sources of data or in setting assumptions
A full listing of the actual assets held to enable an accurate valuation of assets and to check whether they are permitted by regulation or subject to regulatory restrictions

Question 19

Q

Give examples of reconciliation checks that can be performed on data

Answer

A

Reconciling the total number of members / policies and changes in membership / policies using previous data and movement data
Reconciling the total benefit amounts and premiums and changes in them, using previous data and movement data
Where assets are held by a third party, reconciliation between the beneficial owner’s and custodian’s records
Reconciling shareholding at the start and end of the period, adjusted for sales and purchases, and bonus issues

Question 20

Q

Give examples of cross-checks that can be performed on data

Answer

A

Checking movement data against accounting data, e.g. benefit payments
Checking membership data against accounting data, e.g. contributions
Checking asset data against accounting data, e.g. investment returns
Full deed audit, for example checking title deeds to large real property assets

Question 21

Q

Give examples of reasonableness checks that can be performed on data

Answer

A

Checking the average sum assured or premium looks sensible for class of business
Checking the average sum assured or premium against previous data
Checking for unusual values, impossible dates or missing records

Question 22

Q

Give examples of spot checks that can be performed on data

Answer

A

Random checking of individual member or policy data
Checking individual assets or liabilities exist / are held on a given date
Checking that the correct value of an asset or liability has been recorded

Question 23

Q

Outline three problems with using summarized data

Answer

A

The reliability of the valuation will be reduced, as full validation of the data is impossible
Summarized data may miss significant differences between the nature of the benefits that have been grouped together
Summarized data cannot be used to value options and guarantees that apply at an individual level

Question 24

Q

Reasons why industry data is not directly comparable

Answer

A

Different geographical and socio-economic markets
Different policies
Different sales methods
Different practices, e.g. underwriting and claims settlement processes
Different nature of data stored
Different coding of risk factors

Question 25

Q

Four other problems with industry data:

Answer

A

Less detailed and flexible than internal data
More out-of-date than internal data
Data quality depends on the quality of the data systems of all its contributors
Not all organizations contribute, and those that do may not be representative of the market

Question 26

Q

What is risk classification and what is its main aim?

Answer

A

Risk classification – a tool for analyzing a portfolio of prospective risks by their risk characteristics, such that each subgroup of risks represents a homogeneous body of risk.

The main aim of risk classification is to split data into homogeneous groups so that the experience of each group is more stable, and data can be more accurately used, for example to set premiums

Question 27

Q

When is data ‘Consistent’?

Answer

A

Consistent means that when comparing the experience of one group of policyholders with another, say, the data used as a basis for the calculations for each group should be:
- Similar
- Preferably extracted from the same source
- Grouped according to the same criteria
- Equal in terms of reliability