Interpreting data Flashcards

1
Q

What are the main issues with sampling?

A
  • Target population can be difficult to determine and is subjective
  • Sample must be selected from the population at random (each member of the population must have the same chance of being included in the sample
  • A reasonably high response rate is needed (study sample must be high proportion of selected sample = representitive of population of interest)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is sample data used to calculate?

A
  1. Estimate of population value e.g. mean
  2. Precision of estimate
  3. Range of values can be confident true population value lies within
  4. P value

= statistica inference - applied to study sample to provide information about the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Distribution of sample means:

A
  • Approximately normally distributed if sample size <100
  • Mean of the distribution of sample means = population mean
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is accurate?

A

How close sample is to the true population value we are trying to estimate (cannot be assess without gold standard/repeated measurement to compare)

= centred on target

n.b. cannot be accurate if measuring the wrong people

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is precision?

A

Amount of variation in the estimate of the population (increased variation = decreased precision)

= all close together

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is standard error of the mean?
And how is it calculated?

A

= measurement of precision of sample mean as estimate of population mean

SE = SD / √sample size

Larger sample size = smaller SE = more precise

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is standard deviation?

A

Measurement of the amount of variability in the population (how far an individual observation is likely to be from the population mean

n.b. population SD is usually unknown so we have to use sample SD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is not affected by the size of the sample?

A

Standard deviation

Standard error DOES as sample size increases, standard error decreases = more precise estimate of the sample mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are confidence intervals for means?

A

For large samples 95% of individual sample means are within 1.96 X SE of the population mean = 95% of the population mean will be within 1.96 X SE if an observed sample mean

Or… if several independent random samples are drawn from a population and sample measn are calculated for each, the confidence intervals will include the true population of the mean on 95% of occasions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How is a 95% confidence of 150.5 to 150.9 interpreted?

A

With 95% confidence, the mean height of 11 year olds in the general population is between 150.5 and 150.9 cm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the difference between reference ranges and confidence intervals?

A

RR = clinical practice -> using SD to inform if a patient is clinically normal or not

CI - statistical inference in research -> using SE and used in research when statistical inference is required (indication of how precise an estimate the sample mean is of the populations mean)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Which of the following statements is true?

  1. Data which are positively skewed can be used to calculate confidence intervals for mean values
  2. Standard deviations are used to calculate confidence intervals
  3. Confidence intervals provide a measure of precision e.g. of a smaple mean
A
  1. confidence intervals provide a measurement of precision e.g. of a sample mean
    n. b.

not 1. as data should be normally distrivuted otherwise mean could be misleading

not 2. standard errors are used

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How are different confidence intervals calculated?

A

Using different multipliers

95% = 1.96 X SE

99% = 2.58 X SE

90% = 1.64 X SE

these are based on standard normal distribution and obtained from statistical tables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a P value?

A

Probability of a difference at least as big as that observed if null hypothesis is true (no real difference between exposure groups of the population)

NOT the probability that the null hyporthesis is true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does a smaller p-value show?

A

Stronger evidence against null hypothesis (lower chance of getting a difference as big as the on observed if the null hypothesis was true)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How are P values calculated?

A

Test statisitc = summary statistic/ SE e.g. difference in means between two groups

Compared with appripriate statistical distribution to derive p value (computer package tables -> compare with standard normal distribution/ t distribution)

17
Q

What type of test is used to get p value for standard normal distributions?

A

z test

for large samples

18
Q

What type of test is used to get p values from a t distribution?

A

t- test for large/small samples

= comparisons between means

19
Q

What can we infer from a p value of <0.001 when the difference in means is 110g?

A

Probability of getting a difference of at least 110g (in either direction) if the null hypothesis is true is less than 0.1%

i.e. very strong evidence against the null hypothesis

20
Q

When do we use two sided p-values?

A

Because we are interested to see if the absolute difference is >0

= assessment of probability that result due to chance is based on how extreme the size of the departure from the null hypothesis is and not its direction!

21
Q

Whether or not to reject a null hypothesis should be based on confidence intervals and…?

A

The strength of evidence against the null hypothesis as assessed by the p value = most relaible way of interpreting findings

22
Q

What are the 4 common errors made in interpreting data?

A
  1. Accepting a hypothesis if p value is large (should interpret it in terms of strength of evidence against the hypothesis)
  2. Interpreting p<0.05 as statistically significant
  3. Basing conclusions on p value only
  4. Ignoring the contect (if sample split into many subgroups and seperately tested we should take that in to consideration when comparing t values)
23
Q

Why can we not ‘accept’ the null hypothesis?

A

To prove hypothesis -> have to find every person >90 and check non-smoker

To disprove hypothesis -> find just 1 person >90 who smokes

= much easier to find evidence against hypothesis than prove it correct (absence of evidence is not evidence of absence)

= p value of 1 can be obtained even when the null hypothesis of false -> cannot reject the null hypothesis

24
Q

Why is p<0.05 not statistically significant?

A

preferable to consider exact p value /strength of evidence

Cutpoint of 0.05 is arbitury -> commly used but 1 in 20 get p < 0.05 purely by chance

p value depends on sample size (harder to get p value <0.05 in small samples)

25
Q

How do we interpret p values?

A
  1. 1 = weak evidence against the null hypothesis
  2. 01 = increasing evidence against the null hypothesis
  3. 001 strong evidence against the null hypothesis
26
Q

Why is it important to use both confidence intervals & p values?

A

Confidence interval = range of values within which we are reasonably confident the population difference lies

P values = strength of evidence against null hypothesis

STATISTICAL SIGNIFICANCE DOES NOT EQUATE TO CLINICAL IMPORTANCE -> even when p value shows strong evidence against the null hypothesis

27
Q

Why must you look at the context when applying many statistical tests using different subgroups within a sampls?

A

P values should be interpreted with caution (small p value for one test simply due to chance)