19 - Data filtered Flashcards
What is Personal data
Personal data is information which would allow an individual to be identified, either on its own or when combined with other information
Examples of personal data
Name Address Personal email address Occupation DOB Health status Race or ethnicity Criminal record
What is Sensitive personal dat?
Sensitive personal data is information which is more private to the individual
Its disclosure to others without consent could cause the individual a high level of distress or damage
Examples of sensitive personal data
o Racial or ethnic origin o Political opinions o Religious or other similar beliefs o Membership of trade unions o Physical or mental health condition o Sexual life o Convictions, proceeding and criminal acts
Under what conditions might Sensitive personal data be processed
- The data subject has given explicit consent
- It is required by law for employment purposes
- It is needed in order to protect the vital interests of the individual or another person
- It is needed in connection with the administration of justice or legal proceedings
What is the main concern of using / transferring data across international borders?
The legislation around data handling may be more stringent in one of the two countries and organisations need to take extra care to not breach local standards.
Aside from criminal action and fines, what is another damaging effect of data breaches occurring within a companyβs data bases?
Damage to reputation and the ability to retain and attract clients.
What is data governance?
Data governance is the overall management of the: availability, usability, integrity and security of data
Give the aspects that a data governance policy should aim to cover. (5)
- The specific roles and responsibilities of individuals in the organisation with regards to data.
- How an organisation will capture, analyse and process data.
- Issues with respect to data security and privacy
- The controls that will be put in place to ensure that the required data standards are applied
- How the adequacy of the controls will be monitored on an ongoing basis with respect to data usability, accessibility, integrity and security.
Give the data governance risks (4).
Failure to have adequate data governance policy can lead to?
- Legal and regulatory non-compliance
- Inability to rely on data for decision making
- Reputational issues
- Incurring additional costs
Give a data concern around mergers and acquisitions. (3)
- Should data be combined into one system
- Which companyβs system to use
- Data aggregation issues.
Give the main risks associated with data. (6)
- The data are inaccurate or incomplete
- The data are not credible due to being insufficient volume, particularly for the estimation of extreme outcomes.
- The data are not sufficiently relevant to the intended purpose
- Past data may not reflect what will happen in the future.
- Chosen data groups may not be optimal
- The data are not available in an appropriate form for the intended purpose.
Why may past data not be an accurate reflection of future experience.
- Past abnormal events
- Significant random fluctuations
- Future trends not being reflected sufficiently in past data
- Changes in the way in which past data was recorded
- Changes in the balance of any homogeneous groups underlying the data
- Heterogeneity with the group to which the assumptions are to relate
- The past data may not be sufficiently up to date
- Other changes
What is big data?
Big data comprises very large data sets, often brought together from different sources, and which can be analysed very quickly
State the data protection principle which may be difficult to meet when using big data
Personal data should be adequate, relevant and not excessive for the purposes concerned.
How can companies avoid big data being excessive and personal for the given purpose?
Anonymisation can be used to ensure that the data is not considered to be personal data.
List the main uses that actuaries make of data.
- Premium rating, product pricing and determining contributions
- Setting provisions
- Experience analysis
- Risk management - underwriting and reinsurance
- Investing
- Accounting
- Management information
- Marketing
- Administration
List the key data required for active members when valuing a pension scheme
- Membership ID / number
- Date of birth
- Date of joining employer
- Date of joining the scheme
- Date / age of retirement
- Current salary
- Salary scale / growth assumptions
- Category of membership
- Dependents - marital status
- Age of dependants
- Data from previous valuations for reconciliations
Outline the design features of a good proposal form.
- Collects data at an appropriate level - including data that are not currently used but may be used in the future
- Be clear and unambiguous - to capture the correct information
- Have inputs that are quantitative as far as possible
Give a design feature of the claims form in order to store good quality data.
Should be clear and unambiguous and link to the proposal form - to cross check information
Give features of data inputting processes that can ensure that good quality data is stored by a company. (5)
- Inputs should be in the same order as the proposal form
- Staff that are inputting data should be trained
- Financial incentives for accurate inputting
- Data systems should have data validation checks - blank entries and sensible entry values
- Send policyholders copies of the key information in order to check all values are captured correctly
Give the data system features that can help ensure that good quality data is stored by an insurance company.
- The system should be capable of storing information so that historical data is available for future pricing exercises
- System should be robust yet flexible
- System should be secure - restricting access of people who can manipulate data
- Regular checks of data movements and changes
- Single integrated systems can make data handling easier
List reasons why claims data might not be directly comparable between different general insurance companies.
- Organisations operating in different geographical or socio-economic sections of the market
- Different policies being sold - policy conditions or perils
- Products being sold by different sales methods or to different target markets
- Differing underwriting standards at the initial claims stage
- Different companies assessing risk differently - different rating factors
- Data being stored or recorded differently or relating to different time periods
Describe the issues with using industry data.
- Supplied data may be inaccurate or incomplete
- May be out of date
- May not be relevant to the intended purpose
- Data may not be available in intended format
- Chosen data groups within the industry-wide data may not be optimal
- The coding used for the factors by which the data is split may vary between pension schemes
- May have different definitions of benefits
- May not be detailed enough data
- Data may be less flexible than is required
- Data may not be credible at extreme conditions
- Data may not reflect what will happen in the future