theme 2 - data, figures and graphs Flashcards
what is a binomial distribution? what is it based on?
-a distribution of binary data where there is only 2 outcomes e.g. pass or fail
-based on a fixed number of trials characterised by the probability of success in a single trial and the number of trials
what is a poisson distribution?
(similar to binomial but) measures the number of events / distribution of binary data from an infinite sample so gives probability of getting r events in a population whereas binomial distribution is a fixed sample so measures probability of getting r events in a trial
how is normal distribution different to binomial and poisson?
its for continuous data instead of discrete e.g. measuring the weights if newborn babies
why does a normal distribution usually form a bell shaped curve?
because most biological events are the consequence of multiple variables
what can standard deviation be used to determine?
the between subject variability or the subject variability
how do you calculate standard deviation?
sd = square root of variance
to find the variance:
1. find the mean of the data
2.subtract the mean from each data point
3. square each deviation
4. add all the squared deviations together
5. divide the sum by the number of data points in the population
then the sd is just square root of that
(this is for population sd, if youre doing sample sd then at the end you have to divide the sum by the number of data points MINUS 1)
why use a table to present data?
-concise and effective way to present large amounts of data
-data with differing units can be displayed
-precise values can be displayed
why use a boxplot instead of histograms?
-although both show variables divided into groups, box plots can show the spread of continuous data and any outliers
what are figure legends, what do they do, what do they need to contain?
-self contained text associated with each figure, containing all the info necessary to understand the figure
-they tell the reader what any abbreviations/symbols/colours ect mean
-they need a short title as well as the figure number
what are figures?
they are collections of graphs, tables and images grouped together to serve a particular purpose
what is quantitative data and give an example
numerical data such as the va score of a patient or height/weight
what is qualitative data and give an example
data that has been split into categories and is typically more descriptive e.g. a patients feedback on how well they felt their optometry appointment went
what is continuous data?
data that can be measured so can take any value e.g. length
what is discrete data?
count data e.g. number of students in a class/ shoe size
what is ratio data, giving an example and what is its opposite?
differences between measurements where a true zero exists so there can be no negative value in ratio data e.g. height and weight
interval data
what is interval data, give an example? what is its opposite?
differences between measurements but there is no true, measured on a scale where each point is placed at equal distance from one another e.g. temperature
ratio data
what is ordinal data, give an example and its opposite
data in ordered categories e.g. school grades like A, B, C
nomial data
what is nomial data, give an example and its opposite
data in categories with no ordering or direction e.g. hair colour
ordinal data
what is binary data?
qualitative (categorical data) that has only 2 groups
how is qualitative data displayed?
-as a proportion in a pie chart/ bar chart
-in a table
what’s the point of mean median and mode (average) of continuous data?
it allows us to highlight the location of the dataset
what is the point of range, IQR and SD when summarising continuous data?
it acts as a measure of dispersion/ variability
what graphs are used to display continuous data?
histograms, scatter graphs, box plots
how do you convert continuous data into categorical data? Give an example
-by dividing it up and changing it so it becomes dichotomised
-e.g. taking eye pressure values and categorising it into hypertension and normotension
what is another word for relative frequency? what does it mean?
-proportion
-its how often something happens proportional to the total number of trials and is an estimate of probability
when can relative frequency = incidence?
if it is the number of new cases over a given period
when can relative frequency be referred to as prevalence?
when there is a number of existing cases at a specific time or over a given period
what variables does prevalence depend on?
the incidence as well as duration
what is the formula for risk?
risk = number of exposed people with the disease / total number of the exposed sample
what words could you use instead of risk to be more ethical towards certain data sets e.g. a down syndrome screening
sensitivity/ chance due to ‘risk’ having negative connotations
what is the formula for relative risk, what is another word for relative risk?
relative risk = risk in exposed group / risk in nonexposed group
relative risk = risk ratio
what does relative risk allow you to do?
compare the risk between different groups e.g. what’s the difference in risk of acquiring AMD between smokers and non-smokers?
what is the definition of attributable risk?
the incidence in the exposed group that can be attributed to the exposure
what is the formulae for attributable risk and attributable risk as a %?
-attributable risk = risk in exposed group - risk in non-exposed group
-attributable risk percent = attributable risk / incidence of disease in exposed group
What is the formula for NNH (number needed to harm)
NNH = 1/attributable risk
how do you calculate NNT (number needed to treat)
NNT = 1/ attributable risk reduction
what is NNH?
The number of individuals that need to be treated so that one individual presents an adverse reaction accountable to the treatment
what is NNT
the number of patients you need to treat to prevent one additional bad outcome
what is the formula to work out odds?
odds = number of exposed people with disease / number of exposed people without the disease
what does odds ratio do?
it quantifies the association between two events (calculated in a similar way to relative risk)
what is the formula for odds ratio?
odds ratio = odds in exposed group / odds in non exposed group
how do you know whether to use relative risk or odds ratio for a study?
-If its a cohort study, it asks if the outcome is occurring by looking ahead so use relative risk
-if its a case-control study, it asks the frequency of exposure by looking back and so use odds ratio only
how is quality of life measured/ quantified?
using
-time trade off assessment
or
-standard gamble assessment
why do we measure quality of life?
to understand how a disease affects someone’s life or how a treatment improves a patient’s quality of life
how is time trade of assessment done?
where the patient is asked to imagine they have 10 years left to live and decide how many of those years they’d give up - remaining time is expressed as a ratio between 0 and 1 and this is what is used to quantify QoL
how is standard gamble assessment calulated?
the patient is asked to imagine they have 10 years left to live and asked how much they would give up. This is taken as a percentage and then subtracted from 1 to give the quality of life