theme 2 - data, figures and graphs Flashcards

Question 1

Q

what is a binomial distribution? what is it based on?

Answer

A

-a distribution of binary data where there is only 2 outcomes e.g. pass or fail
-based on a fixed number of trials characterised by the probability of success in a single trial and the number of trials

Question 2

Q

what is a poisson distribution?

Answer

A

(similar to binomial but) measures the number of events / distribution of binary data from an infinite sample so gives probability of getting r events in a population whereas binomial distribution is a fixed sample so measures probability of getting r events in a trial

Question 3

Q

how is normal distribution different to binomial and poisson?

Answer

A

its for continuous data instead of discrete e.g. measuring the weights if newborn babies

Question 4

Q

why does a normal distribution usually form a bell shaped curve?

Answer

A

because most biological events are the consequence of multiple variables

Question 5

Q

what can standard deviation be used to determine?

Answer

A

the between subject variability or the subject variability

Question 6

Q

how do you calculate standard deviation?

Answer

A

sd = square root of variance

to find the variance:
1. find the mean of the data
2.subtract the mean from each data point
3. square each deviation
4. add all the squared deviations together
5. divide the sum by the number of data points in the population

then the sd is just square root of that

(this is for population sd, if youre doing sample sd then at the end you have to divide the sum by the number of data points MINUS 1)

Question 7

Q

why use a table to present data?

Answer

A

-concise and effective way to present large amounts of data
-data with differing units can be displayed
-precise values can be displayed

Question 8

Q

why use a boxplot instead of histograms?

Answer

A

-although both show variables divided into groups, box plots can show the spread of continuous data and any outliers

Question 9

Q

what are figure legends, what do they do, what do they need to contain?

Answer

A

-self contained text associated with each figure, containing all the info necessary to understand the figure
-they tell the reader what any abbreviations/symbols/colours ect mean
-they need a short title as well as the figure number

Question 10

Q

what are figures?

Answer

A

they are collections of graphs, tables and images grouped together to serve a particular purpose

Question 11

Q

what is quantitative data and give an example

Answer

A

numerical data such as the va score of a patient or height/weight

Question 12

Q

what is qualitative data and give an example

Answer

A

data that has been split into categories and is typically more descriptive e.g. a patients feedback on how well they felt their optometry appointment went

Question 13

Q

what is continuous data?

Answer

A

data that can be measured so can take any value e.g. length

Question 14

Q

what is discrete data?

Answer

A

count data e.g. number of students in a class/ shoe size

Question 15

Q

what is ratio data, giving an example and what is its opposite?

Answer

A

differences between measurements where a true zero exists so there can be no negative value in ratio data e.g. height and weight

interval data

Question 16

Q

what is interval data, give an example? what is its opposite?

Answer

A

differences between measurements but there is no true, measured on a scale where each point is placed at equal distance from one another e.g. temperature
ratio data

Question 17

Q

what is ordinal data, give an example and its opposite

Answer

A

data in ordered categories e.g. school grades like A, B, C
nomial data

Question 18

Q

what is nomial data, give an example and its opposite

Answer

A

data in categories with no ordering or direction e.g. hair colour
ordinal data

Question 19

Q

what is binary data?

Answer

A

qualitative (categorical data) that has only 2 groups

Question 20

Q

how is qualitative data displayed?

Answer

A

-as a proportion in a pie chart/ bar chart
-in a table

Question 21

Q

what’s the point of mean median and mode (average) of continuous data?

Answer

A

it allows us to highlight the location of the dataset

Question 22

Q

what is the point of range, IQR and SD when summarising continuous data?

Answer

A

it acts as a measure of dispersion/ variability

Question 23

Q

what graphs are used to display continuous data?

Answer

A

histograms, scatter graphs, box plots

Question 24

Q

how do you convert continuous data into categorical data? Give an example

Answer

A

-by dividing it up and changing it so it becomes dichotomised
-e.g. taking eye pressure values and categorising it into hypertension and normotension

Question 25

Q

what is another word for relative frequency? what does it mean?

Answer

A

-proportion
-its how often something happens proportional to the total number of trials and is an estimate of probability

Question 26

Q

when can relative frequency = incidence?

Answer

A

if it is the number of new cases over a given period

Question 27

Q

when can relative frequency be referred to as prevalence?

Answer

A

when there is a number of existing cases at a specific time or over a given period

Question 28

Q

what variables does prevalence depend on?

Answer

A

the incidence as well as duration

Question 29

Q

what is the formula for risk?

Answer

A

risk = number of exposed people with the disease / total number of the exposed sample

Question 30

Q

what words could you use instead of risk to be more ethical towards certain data sets e.g. a down syndrome screening

Answer

A

sensitivity/ chance due to ‘risk’ having negative connotations

Question 31

Q

what is the formula for relative risk, what is another word for relative risk?

Answer

A

relative risk = risk in exposed group / risk in nonexposed group

relative risk = risk ratio

Question 32

Q

what does relative risk allow you to do?

Answer

A

compare the risk between different groups e.g. what’s the difference in risk of acquiring AMD between smokers and non-smokers?

Question 33

Q

what is the definition of attributable risk?

Answer

A

the incidence in the exposed group that can be attributed to the exposure

Question 34

Q

what is the formulae for attributable risk and attributable risk as a %?

Answer

A

-attributable risk = risk in exposed group - risk in non-exposed group

-attributable risk percent = attributable risk / incidence of disease in exposed group

Question 35

Q

What is the formula for NNH (number needed to harm)

Answer

A

NNH = 1/attributable risk

Question 36

Q

how do you calculate NNT (number needed to treat)

Answer

A

NNT = 1/ attributable risk reduction

Question 37

Q

what is NNH?

Answer

A

The number of individuals that need to be treated so that one individual presents an adverse reaction accountable to the treatment

Question 38

Q

what is NNT

Answer

A

the number of patients you need to treat to prevent one additional bad outcome

Question 39

Q

what is the formula to work out odds?

Answer

A

odds = number of exposed people with disease / number of exposed people without the disease

Question 40

Q

what does odds ratio do?

Answer

A

it quantifies the association between two events (calculated in a similar way to relative risk)

Question 41

Q

what is the formula for odds ratio?

Answer

A

odds ratio = odds in exposed group / odds in non exposed group

Question 42

Q

how do you know whether to use relative risk or odds ratio for a study?

Answer

A

-If its a cohort study, it asks if the outcome is occurring by looking ahead so use relative risk
-if its a case-control study, it asks the frequency of exposure by looking back and so use odds ratio only

Question 43

Q

how is quality of life measured/ quantified?

Answer

A

using
-time trade off assessment
or
-standard gamble assessment

Question 44

Q

why do we measure quality of life?

Answer

A

to understand how a disease affects someone’s life or how a treatment improves a patient’s quality of life

Question 45

Q

how is time trade of assessment done?

Answer

A

where the patient is asked to imagine they have 10 years left to live and decide how many of those years they’d give up - remaining time is expressed as a ratio between 0 and 1 and this is what is used to quantify QoL

Question 46

Q

how is standard gamble assessment calulated?

Answer

A

the patient is asked to imagine they have 10 years left to live and asked how much they would give up. This is taken as a percentage and then subtracted from 1 to give the quality of life