Comparing data Flashcards

1
Q

Parametric test for comparing paired, continuous data?

A

Paired t-test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the assumptions for the paired t-test

A

Differences are plausibly normally distributed

Differences are independent of each other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Steps to calculate a p-value from a paired t-test

A
  1. Calculate the differences between the values in each group (d)
  2. Calculate the mean + standard deviation of the means
  3. Calculate the standard error of the mean difference
  4. Calculate the test statistic (t)
  5. Under the null hypothesis, t is distributed as Student’s t, with n-1 degrees of freedom
  6. Look up this value in tables to find p-value (gives two-tailed p value)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How is the test statistic (t) calculated for paired t-test?

A

dbar(-0)/SE(dbar)

dbar = mean of the differences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How is the 100% (1-α)% confidence interval for the mean difference in the population calculated?

A

d(bar)-[txSE(dbar)] to dbar+[txSE(dbar)]
dbar = mean difference
t taken from t distribution table with n-1 degrees of freedom

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the non-parametric equivalent of the paired t-test? (for assessing paired data)

A

Wilcoxon (matched pairs) signed rank test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does the Wilcoxon signed rank test test?

A

Test of null hypothesis that there is no tendency for the outcome under one set of conditions to be higher or lower than under the comparison set of outcomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the parametric test for independent, continuous data?

A

Independent samples t-test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the assumptions for the independent two sample t-test

A

Two independent groups
Continuous outcome
Outcome data in both groups is normally distributed
Outcome data in both groups have similar standard deviations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Steps to calculate a p value from the independent two sample t-test

A
  1. Calculate the difference between means of groups
  2. Calculate the pooled standard deviation of the means
  3. Calculate the standard error of the difference between two means
  4. Calculate the test statistic (t)
  5. Compare the test statistic with the t distribution with n1+n2-2 degrees of freedom
  6. Look up this value in tables to find p-value (gives two-tailed p value)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How is pooled SD for two independent groups calculated?

A

√[(n1-1)SD1^2+(n2-1)SD2^2]/n1+n2-2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How is standard error of difference between two means calculated for two independent groups?

A

pooled SDx√(1/n1)+(1/n2)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How is the test statistic (t) calculated for independent samples t test?

A

d/SE
d = observed difference in means
SE = standard error of difference in means

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How is the 100% (1-α)% confidence interval for the difference between means in the population calculated?

A

(xbar1-xbar2) +/- [txSE(difference)]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the non-parametric equivalent of the independent samples t-test?

A

Mann-Whitney U test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does the Mann-Whitney U test show?

A

Test of the null hypothesis that the distribution of the outcome variable in the two groups is the same

17
Q

Steps for Mann-Whitney U test

A
  1. Arrange all data in increasing order
  2. Choose one group: for each observation in that group, count how many observations in the other group lie below it
  3. All numbers added up = U-statistic
  4. Compare U test statistic with theoretical distribution under null hypothesis (that samples come from the same population)
18
Q

What two tests can be used when assessing more than two independent groups?

A

Analysis of variance technique (ANOVA) = parametric

Kruskal-Wallis test = non-parametric

19
Q

How can we compare binary outcome data?

A

Comparison of two proportions

20
Q

What are the requirements for a comparison of two proportions?

A

Only makes sense in 2x2 tables e.g. yes or no outcome
Two independent samples or groups
Large sample with all expected frequences >5
- np and n(1-p) should both exceed 5
- n = total number of individuals in both samples
- p = proportion of individuals with condition
Assumes a common proportion

21
Q

How is the common proportion calculated in a comparison of two proportions?

A

p=(n1p1 + n2p2)/(n1+n2)
n = sample size
p = proportion with condition

22
Q

How is the standard error for the difference in proportions calculated, where a common proportion is assumed?

A

SE(p1-p2) = √p(1-p)(1/n1+1/n2)

Where p is the common proportion calculated

23
Q

How is the test statistic (z) calculated in the comparison of two proportions

A

z=(p1-p2)(-0)/SE(p1-p2)

24
Q

How is the 95% confidence interval for the difference in proportions calculated?

A

(p1-p2)+/- [1.96xSE(p1-p2)]

25
Q

How is standard error of the difference in proportions calculated if no common proportion is assumed?

A

SE(p1-p2) = √(p1(1-p1))/n1 + (p2(1-p2))/n2

26
Q

What test is used when comparing unordered, categorical data?

A

Chi-squared test (χ^2)

27
Q

What are the requirements for chi-squared test?

A
Two independent groups
Unordered, categorical variables
At least 80% expected cell counts ≥5
All expected cell counts ≥1
Expected values calculated by total proportion healed used to predict proportion healed for each variable
28
Q

Steps for chi-squared test

A
  1. Expected frequency = row totalxcolumn total/N where N is total sample size
  2. Calculate difference between observed and expected value for each cell
  3. Square each difference
  4. Divide resultant quantity by expected value
  5. Sum all of these to get a single number test statistic
  6. Compare with table of chi-squared distribution
29
Q

How is the degrees of freedom calculated in the chi-squared test?

A

(no. rows -1)x(no. columns -1)

30
Q

What correction can be used on 2x2 tables to allow chi-squared calculations

A

Yates correction

Chi-squared = SUM ((|O-E|-0.5)^2)/E

31
Q

What test is done on 2x2 table when expected cell counts <5, or any cell count <1

A

Fisher’s exact test

Estimates probability of falsely rejecting the null hypothesis exactly

32
Q

When is the chi-squared test for trend used?

A

In a 2x3+ table

When the variable with 3+ categories is ordered

33
Q

How can risk be compared in a 2x2 table?

A

Relative risk
(a/a+c)/(b/b+d)
Under the null hypothesis, the expected value is 1

Odds ratio
ad/bc
Expected value is 1 under null hypothesis

34
Q

How is the standard error of the natural logarithm of RR calculated?

A

SE(logeRR) = √(1/a)-(1/a+c)+(1/b)-(1/b+d)

35
Q

How is the 95% CI for logeRR calculated?

A

logeRR +/- 1.96xSE(logeRR)

This can then be converted using the function of e

36
Q

How is the standard error of the natural logarithm of OR calculated?

A

√(1/a)+(1/b)+(1/c)+(1/d)

37
Q

How is the 95% CI for logeOR calculated?

A

logeOR +/- 1.96xSE(logeOR)

Converted using e function

38
Q

How is categorial data with paired outcomes compared?

A

McNemar’s test

e.g. patients treated with two different treatments + outcomes compared

39
Q

How is McNemar’s test calculated?

A

(|b-c|-1)^2/b+c