Displaying data Flashcards

Question 1

Q

Types of data

Answer

A

Categorical:
Binary
Ordinal
Nominal

Numerical:
Discrete
Continuous

Question 2

Q

Summarising categoric data

Answer

A

Proportion
Percentage
Rate
Odds

Question 3

Q

Summarising numeric data

Answer

A

Normal distribution, symmetric data (mean+SD)

Non-normal distribution, skewed (Median+IQR)

Question 4

Q

Minimisation steps

Answer

A

See confounders and which group new patient would fit in

Choose minimisation factor (e.g. 80 would mean 80% chance of being in best fit group)

Question 5

Q

Transforming data

Answer

A

Tukey’s ladder of transformations
For upward skew: x^1/2, log(x), -1/x, -1/x^2
To correct downward skew: x^2, x^3, antilog(x)
Back transform calculated mean and SD that used transformed data

Question 6

Q

Quantify differences between groups

Answer

A

Difference between 2 means only if both groups are normally distributed
Difference between 2 medians always valid

Question 7

Q

Quantifying associations between groups

Answer

A

Correlation coefficient Pearson’s for parametric when linear

Spearman’s is non-parametric and doesn’t need to be linear

Question 8

Q

Standard Error

Answer

A

Measure of precision, spread of sample means

SE=SD/n^1/2

Question 9

Q

Requirements to calculate standard error with single mean

Answer

A

Sample size >20

Sample normally distributed

Question 10

Q

Confidence intervals

Answer

A

Range of means the population is compatible with e.g. 95% CI means for 95% (sample mean ±1.96 x SD) of samples, CI range will contain population mean

Question 11

Q

Requirements to calculate standard error with diff between 2 means

Answer

A

Both normally distributed
Both groups sample more than 20
Similar SDs (no more than 2x other)

Question 12

Q

Standard error for a proportion

Answer

A

(p(1-p)/n)^1/2
Assumes n>20
Assumes 0.1< p <0.9

Question 13

Q

SE and CIs for relative risk

Answer

A

RR/OR transformed to normal using natural log

Confidence intervals calculated then back transformed at the very end

Question 14

Q

SE and CI of Pearson correlation coefficient

Answer

A

Use ln transformed scale for SE then back transform right at the end with CIs
0.5ln(1+r/1-r)

Question 15

Q

Non-parametric CI

Answer

A

Bootstrapping, resampling with replacement
95% range is 2.5th and 97.5th centile difference
Median from this is best estimate of population average

Question 16

Q

Bootstrapping for difference in means

Answer

Study These Flashcards

A

Take multiple bootstrap samples from each group and find difference in means

Question 17

Q

Bootstrap CI characteristics

Answer

Study These Flashcards

A

Bootstrap samples must be same size as original sample

Always valid, should give similar result to calculated SE if data is normally distributed

Displaying data Flashcards

(17 cards)