19 - Data Flashcards
What is Personal data
Personal data is information which would allow an individual to be identified, either on its own or when combined with other information
Examples of personal data
Name Address Personal email address Occupation DOB Health status Race or ethnicity Criminal record
What is Sensitive personal dat?
Sensitive personal data is information which is more private to the individual
Its disclosure to others without consent could cause the individual a high level of distress or damage
Examples of sensitive personal data
o Racial or ethnic origin o Political opinions o Religious or other similar beliefs o Membership of trade unions o Physical or mental health condition o Sexual life o Convictions, proceeding and criminal acts
Under what conditions might Sensitive personal data be processed
- The data subject has given explicit consent
- It is required by law for employment purposes
- It is needed in order to protect the vital interests of the individual or another person
- It is needed in connection with the administration of justice or legal proceedings
What is the main concern of using / transferring data across international borders?
The legislation around data handling may be more stringent in one of the two countries and organisations need to take extra care to not breach local standards.
List the eight conditions of the POPI Act in South Africa.
- Accountability
- Processing limitation
- Purpose specification
- Further processing
- Information quality
- Openness
- Security safeguards
- Data subject participation
Describe the POPI Act condition of accountability.
The party responsible for processing the data is also responsible for compliance with POPI.
Describe the POPI Act condition of processing limitation.
Information must be processed in a fair, lawful and relevant manner, after consent is given by the data subject.
Describe the POPI Act condition of purpose specification.
Personal information must be collected for a specific purpose. Record keeping to be destroyed when personal data is no longer relevant or authorised to be held.
Describe the POPI Act condition of further processing limitation
Further processing must be compatible with the initial collection prupose.
Describe the POPI Act condition of information quality
Data completeness, accuracy and updates to be ensured by holder of the data.
Describe the POPI Act condition of openness
Documentation to be maintained on all processing operations and maintaining transparency on data use.
Describe the POPI Act condition of security safe-guards
Integrity and confidentiality of personal data must be secured and all processing done only by authorised operators. Notification to be done on security compromises.
Describe the POPI Act condition of data subject participation.
The data subject may request confirmation of personal data held and request correction or deletion of any inaccurate, misleading or outdated information held.
Aside from criminal action and fines, what is another damaging effect of data breaches occurring within a companyβs data bases?
Damage to reputation and the ability to retain and attract clients.
What is data governance?
Data governance is the overall management of the: availability, usability, integrity and security of data
Give the aspects that a data governance policy should aim to cover. (5)
- The specific roles and responsibilities of individuals in the organisation with regards to data.
- How an organisation will capture, analyse and process data.
- Issues with respect to data security and privacy
- The controls that will be put in place to ensure that the required data standards are applied
- How the adequacy of the controls will be monitored on an ongoing basis with respect to data usability, accessibility, integrity and security.
Give the data governance risks (4).
Failure to have adequate data governance policy can lead to?
- Legal and regulatory non-compliance
- Inability to rely on data for decision making
- Reputational issues
- Incurring additional costs
Give a data concern around mergers and acquisitions. (3)
- Should data be combined into one system
- Which companyβs system to use
- Data aggregation issues.
Give the main risks associated with data. (6)
- The data are inaccurate or incomplete
- The data are not credible due to being insufficient volume, particularly for the estimation of extreme outcomes.
- The data are not sufficiently relevant to the intended purpose
- Past data may not reflect what will happen in the future.
- Chosen data groups may not be optimal
- The data are not available in an appropriate form for the intended purpose.
Why may past data not be an accurate reflection of future experience.
- Past abnormal events
- Significant random fluctuations
- Future trends not being reflected sufficiently in past data
- Changes in the way in which past data was recorded
- Changes in the balance of any homogeneous groups underlying the data
- Heterogeneity with the group to which the assumptions are to relate
- The past data may not be sufficiently up to date
- Other changes
What is big data?
Big data comprises very large data sets, often brought together from different sources, and which can be analysed very quickly
State the data protection principle which may be difficult to meet when using big data
Personal data should be adequate, relevant and not excessive for the purposes concerned.
How can companies avoid big data being excessive and personal for the given purpose?
Anonymisation can be used to ensure that the data is not considered to be personal data.
List the main uses that actuaries make of data.
- Premium rating, product pricing and determining contributions
- Setting provisions
- Experience analysis
- Risk management - underwriting and reinsurance
- Investing
- Accounting
- Management information
- Marketing
- Administration
List the key data required for active members when valuing a pension scheme
- Membership ID / number
- Date of birth
- Date of joining employer
- Date of joining the scheme
- Date / age of retirement
- Current salary
- Salary scale / growth assumptions
- Category of membership
- Dependents - marital status
- Age of dependants
- Data from previous valuations for reconciliations
Outline the design features of a good proposal form.
- Collects data at an appropriate level - including data that are not currently used but may be used in the future
- Be clear and unambiguous - to capture the correct information
- Have inputs that are quantitative as far as possible
Give a design feature of the claims form in order to store good quality data.
Should be clear and unambiguous and link to the proposal form - to cross check information
Give features of data inputting processes that can ensure that good quality data is stored by a company. (5)
- Inputs should be in the same order as the proposal form
- Staff that are inputting data should be trained
- Financial incentives for accurate inputting
- Data systems should have data validation checks - blank entries and sensible entry values
- Send policyholders copies of the key information in order to check all values are captured correctly
Give the data system features that can help ensure that good quality data is stored by an insurance company.
- The system should be capable of storing information so that historical data is available for future pricing exercises
- System should be robust yet flexible
- System should be secure - restricting access of people who can manipulate data
- Regular checks of data movements and changes
- Single integrated systems can make data handling easier
List reasons why claims data might not be directly comparable between different general insurance companies.
- Organisations operating in different geographical or socio-economic sections of the market
- Different policies being sold - policy conditions or perils
- Products being sold by different sales methods or to different target markets
- Differing underwriting standards at the initial claims stage
- Different companies assessing risk differently - different rating factors
- Data being stored or recorded differently or relating to different time periods
Describe the issues with using industry data.
- Supplied data may be inaccurate or incomplete
- May be out of date
- May not be relevant to the intended purpose
- Data may not be available in intended format
- Chosen data groups within the industry-wide data may not be optimal
- The coding used for the factors by which the data is split may vary between pension schemes
- May have different definitions of benefits
- May not be detailed enough data
- Data may be less flexible than is required
- Data may not be credible at extreme conditions
- Data may not reflect what will happen in the future
Describe how industry data may not be relevant for a benefit scheme.
- Not originally collected for the intended purpose
- May not be accurately comparable
- Employer operates in a different geographical region
- Has a different socio-economic mix of members than the industry average
- The nature of the benefits is very different to industry averages
- The extent of voluntary vs compulsory membership differs from the industry average
List the reasons that past data may not reflect future experience.
- Past abnormal events
- Significant random fluctuations
- Future trends not reflected in historical data
- Changes in the way in which past data was recorded
- Changes in the balance of homogeneous groups underlying the data
- Other changes such as medical, social or economic.
What is algorithmic decision making?
Algorithmic decision making refers to investment trading decisions that are automated so that they take place without human interventions
Give the advantages of algorithmic decision making.
- Faster
- More efficient
- Possibly lower dealing costs on trades
- It is possible to implement complex trading strategies
Give the disadvantages of algorithmic decision making.
- Could be errors in the algorithm or the data or parameters - especially when a large number of trades could be implemented in a short period
- Algorithm may not operate in adverse conditions
- Trading may be suspended before algorithmic trading can be completed in turbulent markets
- Could have adverse impacts on the financial systems
Give the main sources of data for an insurance company.
- Publicly available data
- Internal data
- Industry-wide data collection
- Reinsurance data
Give the main issue with using internal data.
There may not be sufficient data for launching or pricing a new product or target markets.
Give some advantages of using industry-wide data
- Can compare experience with that of the industry as a whole
- Can provide an indication of the ways in which competitors attract policyholders through differences between the company and the industry-wide data.
How can individual assets data be checked:
- Check that a liability or asset exists on a given date
- Check that a liability is held or an asset is owned at a given date
- Check that when an event is recorded, the time of the event and the associated income or expenditure are allocated to the correct accounting period
- Check that data are complete β no unrecorded liabilities or assets
- Check the appropriate value of an asset or liability has been recorded
Describe the main data checks that actuaries can use.
RECONCILIATIONS
- Reconciliations of member / policy numbers
- Reconciliations of benefits and premiums
- Reconciliation of beneficial owner and custodian records where assets are owned by a third party
CONSISTENCY CHECKS
- Consistency of asset income data and accounts
- Consistency of salary related contribution and benefit levels with the accounts
- Consistency between average sum assured and premium for each class, and when compared with previous investigations
- Consistency between start and end period shareholdings adjusting for sales etc
- Consistency between actuary and custodianβs records
- Consistency between contributions and active members
- Consistency between pension benefits and number of pensioners
SPOT CHECKS
- Records picked at random for spot checks.
- Look for very high or very low values/ unusual values
- Full deed audit for certain assets, eg property
- Validity of dates
Give the main circumstances under which there may be a lack of ideal data.
- Insufficient volume to provide credible results
2. Data captured at insufficient details
Describe the main issue with using summarised data.
The records cannot be validated which makes error detection impossible
Outline the issues with provisions that are too large
- The funding level will appear to be lower than it actually is
- Capital may not be used efficiently
Outline the issues with provisions that are too small
- Over time it will become apparent that additional money is required
- Worst case is insolvency
3, Profits will be recognised earlier and the payment of tax will be accelerated4. Inappropriate business decisions can be made.