Handling Data Flashcards
What enables data sharing whilst preserving privacy?
Pseudonymisation and Anonymisation
What is anonymisation?
This is the removal of all personal identifiers (direct and indirect) which may lead to an individual being identified.
This means there is PERMANENT removal of all links.
What is pseudonymisation?
This the processing of data so that it can’t be linked too specific data subjects without ‘additional information’.
‘Additional information’ must be kept separately and protected.
How do you check data?
Data Verification
Data Validation.
What is Data Verification?
Questions whether the data inputted is accurate.
What is Data Validation?
Questions whether the data is reasonable when compared against a set of rules.
What is a normal distribution?
Creates a bell - shaped curve.
Mean and Median are equal + centred in the middle of the distribution.
Predictable data falls within 1,2 and 3 standard deviations of the mean.
What describes the normal distribution?
Described by the mean and standard deviation.
What is variance?
Where is the greatest variance?
How far a data set is spread out.
Greatest variance is on the outskirts of the bell - shaped curve.