Module 3 Flashcards

1
Q

How do you summarise categorical data?

A
  • frequencies, proportions or percentages
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What test is used to compare the distribution of categorical variable to the hypothesised distribution?

A

chi squared goodness-of-fit test /one sample chi squared test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is an example of a one-sample chi squared test hypothesis?

A

H0: is evenly distributed
H1: is not evenly distributed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why is the one sample chi-squared test (chi squared goodness-of-fit test) used?

A
  • to quantify the discrepancy between the expected and observed frequencies
    e. g. between sample and hypothesised value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the shape of the chi-squared distribution?

A
  • non-symmetric (always positive)

- changes with the df (degrees of freedom)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is degree of freedom?

A

= number of groups -1

- indicates how many of the data points are ‘flexible’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What test is used to look at association between 2 categorical variables?

A
  • chi-squared test of independence
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How is the test statistic calculated for chi-squared test of independence?

A
  • same as normal except need to calculate for each cell in the table
    e. g. (column total x row total)/overall total
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How do you calculate the df for a contingency table/cross-table?

A

df = (number of rows - 1) x (number of columns - 1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does a small x^2 value mean?

A
  • when the observed value is approximately eyqla o the expected value in each cell
  • only vary due to sample variability
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What causes a large x^2 value?

A
  • sample variability (given by p-value)

- Null hypothesis is not true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the chi-squared test of independence assumptions?

A
  • the observational units are independent

- the expected cell counts should be >5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the limitations of x^2 test of independence?

A
  • not informative about how variables are related

- only really be used for bivariate analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are other options for assessing associations in categorical variables?

A
  • relative risk

- odds ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Can chi-squared test of independence be used for before and after?

A
  • no because the measurement is on the same individual
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a McNemar’s test?

A
  • used for 2x2 tables to test repeated measurments on the same variable

SIMPLE CONCEPT:

  • if no change, participants stay on diagnoal
  • if change, participants move off the diagonal
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What test is used for continuous data?

A

one sample t-test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is a t-test?

A
  • parametric test used for testing differences in means

- tests the hypothesis that the means of a sample is equal to a fixed value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What does the one sample t-test assume?

A
  • data is normal distributed
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is the test statistic equation for a one-sample t-test?

A

t = (sample mean - expected value)/ (sample sd/ square root of the sample size)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

what influence the fatness of the t-test distribution tail?

A
  • degrees of freedom
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What makes a t-distribution more normally distributed?

A
  • bigger sample size/more df
23
Q

What is the t-test distribution if n>30?

A
  • sampling distribution of means is approximately normally distributed
24
Q

When is a two sample/independent sample t-test used?

A

compare two groups

  • dependent is continuous
  • independent is categorical
25
Q

What are the assumptions for a two-sample t-test?

A
  • distribution is normally distributed or >30
  • results come from two independent samples
  • variances in the two groups are the same
26
Q

what test is used if it is unknown if the sample is normal?

A

non-parametric test (mann-whitney test)

27
Q

What does a mann-whitney test/Wilcoxon Rank sum test compare?

A
  • medians of two samples
28
Q

When is a paired t-test used?

A
  • before and after

- left and right arm

29
Q

What is a ANOVA t-test?

A
  • one way analysis of variance

- used when >2 groups

30
Q

What is the ANOVA hypotheses?

A
Null = means are the same
H1 = at least one mean differs
31
Q

What are the two types of variation within data for ANOVA?

A
  • between groups

- within group

32
Q

How can you tell if the variation is between groups?

A
  • distributions are at different levels of the x-axis
33
Q

How can you tell if the variation is within groups?

A
  • the distributions overlap but are very wide
34
Q

How do you calculate total variation?

A

sstotal (sum of squares) = sum of (mean - overall mean)^2

35
Q

What does the conversation of SS to MS (mean square) for?

A
  • account for different df in each calculation
36
Q

What is the total variance equation?

A

MStotal = SStotal/(N-1)

37
Q

What is the between group variance equation?

A

MSgroups = SSgroups/(k-1)

38
Q

What is the within group variance equation?

A

MSerror = SSerror/(N-k)

39
Q

What does the ratio of MSgroups/MSerror show?

A
  • how much bigger groups effect is compared to random noise
40
Q

What is the variance ratio?

A
  • ratio of two variances

- denoted F (f is the test statistic for ANOVA)

41
Q

When is a post-hoc test used?

A
  • if the H0 is rejected in an ANOVA to determine which means are different
42
Q

What is the most common post hoc test?

A
  • tukey
43
Q

What is the steps for interpreting ANOVA results?

A
  • check ANOVA assumptions
  • conduct ANOVA
  • if p-value>0.5 then do not reject H0)
  • if p-value<0.5 then reject and do post-hoc testing
44
Q

What test is used for normality assumption (ANOVA)?

A
  • non-parametric test such as kruskal-wallis test
45
Q

What test is used for equal variances assumption?

A
  • levene’s test to test the H0 that variances of groups is the same
  • if test is significant the variances are not equal
46
Q

what test statistics equation is used for one-sample chi squared test?

A

= observed - expected/precision

47
Q

What type of test is a t-test?

A
  • parametric test
48
Q

What is an assumption of a parametric test?

A
  • assumes the data follows a known distribution
49
Q

What does a one-sample t-test test?

A
  • the hypothesis that the mean of a sample is equal to a fixed value
50
Q

What t-test is used to decide if variances are equal?

A

Levene’s test

51
Q

What is an advantage of paired test?

A
  • takes out the variation between patients and only the effect of a drug
52
Q

What are the assumptions for ANOVA?

A
  • normally distributed or >30
  • equal variances
  • independence among observations
53
Q

What number of type 1 error is achieved after all post-hoc tests?

A
  • 0.05