Week 4 Flashcards

1
Q

What is Chi-squared ?

A

A test of difference between categorical variables (Nominal / ordinal)

Unlike binomial tests it isn’t limited to dichotomous variables (success/fail) and can test more than 2 categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the two types of chi-squared tests?

A

1) Goodness of fit test

2) Test of association/independence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Benford’s law?

A

AKA First digit law
It states the frequency of first digits of naturally occurring numerical data such as prices or populations is likely to be small.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are paired samples? Example?

A

It is data across the same group before and after intervention or under two different conditions

Example - Usually tests before and after such as whether nicotine patches help quit smoking, the two variables are the nicotine patches and whether they smoked before the test and if they smoke after.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a student t-test?

A

Difference in means of a group of measures of continuous variables (interval/ratio)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

3 types of Student t-test? Which test do they correspond to?

A

Correspond to the tests for nominal/ordinal values

1) One sample t-test - binomial or Chi square goodness of fit
2) Independent/unpaired sample t-test - Chi-square test of association
3)Paired samples test - McNemar’s test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When is the One sample t-test used? Difference to Chi? Example?

A

When we want to test whether the mean of a single sample of continuous data (Interval/ratio) differs from known/hypothesised mean

It differs from chi-squared as it is not categorical data and measures means rather than frequencies/proportions

Example - Comparing 20 Male student Vo2 max tests to 20 published Vo2 max tests.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When is the Independent/unpaired samples t-test used? Difference to Chi? Example?

A

Compares the observed difference between the means of two independent groups

It differs as it measuring means not frequencies, of continuous (numeric) data.

Example - Comparing exam marks from Class A and Class B

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

When is the paired t-test used? Difference to Chi? Example?

A

Compares the means of two related groups / a group measure on two different occasions and the difference is compared.

Data is continuous, measures means.

Example - Comparing weight of participants before and after lockdown

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does Normality mean in terms of t-tests?

A

It is the assumption of normality, it is when we assume the distribution of the test results are evenly/normally distributed and shows it is accurate, it is often common with a sufficiently large sample size as it is more forgiving

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the tests for sample t-tests?

A

Parametric tests - Statistical tests based on the normality assumption.

Test of normality - Should not assume normality, so use tests such as Shapiro-Will test to find a p-value to determine significance

Non Parametric tests - Don’t require the normality assumption, focuses more on order and ranking instead of face value data

Equality of variance - Using Levene’s test of EV, to test significance between two variances and whether they are equal, if not equal must do Welch’s test.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are T-Statistics ?

A

What T-Tests are based off.
Similar to z-score but it is about Mean and SD of sample not population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Equation for T Value?

A

Degree of freedom (df) = sample size - number of groups

Greater T value means greater difference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How is t-test data reported? Example

A

T value, Degree of freedom and p-value, usually with descriptive statistics such as Mean and SD.

Example - T(48) = -3.1, p = 0.003

T value with 48 degrees of freedom
-3.1 size of the difference
P = .003 = significant difference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How to calculate final report ? With example

A

1) Formulate hypotheses - null and alternative = (no)significant difference between two groups (group a and b exam marks)

2) Calculate means and SD
Given -
A = 78 mean, 10 SD, 25 sample size
B = 85 mean, 12 SD, 25 sample size

3) Calculate standard error and t-stats =
SE = Mean A - Mean B / Square root of (SD A^2 / sample size) = SD B^2 / sample size)
SE = 78 - 85 / square root of 10^2/25 + 12^2/25 = 3.12
T = -7 / 3.12 = 2.24

4) Calculate DF
DF = Sample A + B - 2 (groups)

5) Determine p-value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

When is the chi-squared goodness of fit test used? Difference to TOA and Binomial? Example?

A

When analysing categorical data (nominal/ordinal), With more than 2 levels.
We want to see if observed/hypothesised data frequencies are the same expected.

Instead of finding relationship between two or more categories, binomial can only do 2 not more

Example - Rolling a dice to see if it is fair/even, what numbers they get for n-tosses

17
Q

When is the chi-squared test of association used? Difference to GOF? Example?

A

To determine a relationship between 2 or more categorical (ordinal/nominal) variables

Instead of testing to see if observed frequencies match expected

Example - Whether people in the UK in different age groups have different car preferences

18
Q

Difference between McNemar’s and paired sample t-test?

A

McNemar’s is categorical and paired sample is continuous