PAS - Data Analysis Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

what is a t-test?

A

statistical test that compares means to determine if there’s a statistical difference between two groups

data is assumed to be parametric - follows a normal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

statistically significant p value?

A

when the p value is smaller than the alpha value/ threshold

reject null hypothesis (Ho)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

when the p value is greater than the alpha value/ threshold (0.05)…

A

no stastically significant difference

accept null hypothesis (Ho)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

risk of smaller alpha value threshold?

A

higher risk for false negatives/ type 2 errors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

risk of larger alpha value?

A

higher risk of false positives/ type 1 errors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

paired t test?

A

comparing measurements taken from the same subjects at different timepoints

e.g. blood pressure measurements before and after treatment from the same person

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

unpaired t test?

A

comparing measurements taken from two independent samples/ subjects

e.g. blood pressure measurements forma control vs treatment group

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

parametric data?

A

data assumed to follow a normal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

non-parametric data?

A

skewed data - doesn’t follow a normal distribution

small sample sizes are often non-parametric data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

how to identify if data is parametric?

A
  1. evaluate data distribution by plotting the data - e.g. histogram plots
  2. test of normality - e.g. Shapiro-Wilk test - p < 0.05 = sufficient evidence to suggest non-parametric data
  3. assess sample size - small sample sizes will likely produce non-parametric data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

statistical test conducted with parametric & paired data?

A

paired t-test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

statistical test conducted with parametric & unpaired data?

A

unpaired t-test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

statistical test conducted with non-parametric & paired data?

A

Wilocon signed rank

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

statistical test conducted with non-parametric & unpaired data?

A

Mann-Whitney U test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

one-tailed test?

A

test assessing the possibility of effect in one direction between two groups

e.g. treatment causing only an increase in survival

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

two-tailed test?

A

test assessing the possibility of effect in both direction between two groups

e.g. treatment causing an increase or decrease (any change) in blood pressure between two groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

risk of conducting multiple statistical tests on data?

A

increases chance of false positives/ type 1 errors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

what is ANOVA?

A

analysis of variance - statistical analysis test to analyse the differences between means of more than two groups, make multiple comparisons

assume data is parametric & independent/ unpaired samples

19
Q

one way ANOVA?

A

compares the difference between means of three or more independent groups on a single continuous variable

e.g. three independent groups & the effects of treatment on blood pressure

20
Q

two way ANOVA?

A

compares how the mean of two quantitative/categorical variables change with a single continuous variable

e.g. sex of subjects within sample (two categorical variables) & effects of treatment on their blood pressure

21
Q

statistical type of ANOVA test for unpaired, parametric data?

A

one way ANOVA

22
Q

statistical type of ANOVA test for paired, parametric data?

A

repeated measures ANOVA

(tests whether there are statistically significant differences in three or more dependent samples)

23
Q

statistical type of ANOVA test for unpaired, non-parametric data?

A

Kruskal-Wallis test

24
Q

statistical type of ANOVA test for paired, non-parametric data?

A

Friedman test

25
Q

what is a categorical variable?

A

variables with discrete options/states with no predictable quantitative scale/ relationship between them

e.g. success/failure, hair colour, race, sex

26
Q

what is a continuous variable?

A

variables measured on a quantitative scale

e.g. age, height, weight, temperature, blood pressure measurements

27
Q

what are post-tests?

A

evaluates specific pairwise comparisons following significant ANOVA tests results

28
Q

what are the three types of post-tests?

A
  1. Tukey’s test
  2. Punnett’s test
  3. Bonferroni correction
29
Q

Tukey’s test?

A

pairwise comparison post-test following ANOVA for EVERY possible combination of groups

e.g. A compared with B, C, D; B compared with C, D,; C compared with D

(all combinations of all groups = p-values for each combination are calculated)

30
Q

Punnett’s test?

A

pairwise comparison post-test following ANOVA for each group to a control group

e.g. A(control) compared with B, C, D
= p-values for each combination are calculated and compared

31
Q

Bonferroni’s test?

A

pairwise comparison post-test following ANOVA for any chosen combination (not adhering to a pattern)

e.g. p-values calculated and compared between B and D

32
Q

what are the three statistical tests for analysing categorical data?

A

Pearson’s Chi-squared test
Fisher’s exact test
McNemar’s test

33
Q

Pearson’s Chi-squared test?

A

for large sample sizes and 2x2 or larger contingency tables with unpaired/ independent data

tests the null hypothesis - no significant difference between the expected and observed frequencies

34
Q

Fisher’s exact test?

A

for small sample sizes (or when expected frequency for a category are less than 5)

ideal for 2x2 contingency tables with unpaired/ independent data

35
Q

McNemar’s test?

A

for categorical data in 2x2 contingency tables with paired data

determines if there’s a significant difference in proportions for matched pairs

36
Q

types of data for Chi-squared tests analysis?

A

categorical data
independent samples (unpaired)
not a 2x2 contingency table or small sample size - analyses data that doesn’t meet Fisher’s exact test requirements

37
Q

types of data for Fisher’s exact test analysis?

A

categorical data
independent samples (unpaired)
2x2 contingency table or small sample size (under 10 samples)

38
Q

types of data for McNemar test anaylsis?

A

categorical data
paired data (non-independent)
conforms to 2x2 contingency table

39
Q

ideal post-hoc tests for multiple comparisons of categorical data (i.e. with 2x3 or more contingency tables)?

A

chi-squared or Fisher’s exact tests with Bonferroni correction

adjusts p values to account for multiple comparisons by multiplying the raw p-value by the number of comparisons (p x n)

helps control type 1 errors/ false positives

40
Q

contingency table?

A

displays frequencies/outcomes for two categorical variables

41
Q

limitations of p-values?

A

p-values assess whether observed differences are likely due to random variation or actual differences between groups - don’t measure the practical significance or magnitude of the difference

e.g. a statistically significant p-value result might only represent a minor difference that isn’t practically meaningful

42
Q

confidence interval?

A

alternative to p-values

gives a range of values within which the true effect size is expected to lie with a certain probability

e.g. a 95% CL means the true mean difference between groups would lie within that mean range 95% of the time

43
Q

compare p-values and confidence intervals?

A

p-value = indicates the probability of observing the data if the null hypothesis is true
- p value smaller than the alpha threshold suggests there’s a greater chance of the observed data being statistically different to the null hypothesis/ expected data

confidence interval = provides a range in which the true (mean) difference between two groups is expected to lie within the given CL % value

44
Q

statistical significance, p-values confidence intervals? - based on 0

A

if a 95% CL doesn’t contain 0 within its range - the result is statistically significant at α = 0.05
- implies p < 0.05

if a 95% CI contains zero - the result is NOT statistically significant at α = 0.05
- implies p > 0.05

if 95% CL has 0 as a boundary - p = 0.05