Week 4 Flashcards

Question 1

Q

What is Chi-squared ?

Answer

A

A test of difference between categorical variables (Nominal / ordinal)

Unlike binomial tests it isn’t limited to dichotomous variables (success/fail) and can test more than 2 categories

Question 2

Q

What are the two types of chi-squared tests?

Answer

A

1) Goodness of fit test

2) Test of association/independence

Question 3

Q

What is Benford’s law?

Answer

A

AKA First digit law
It states the frequency of first digits of naturally occurring numerical data such as prices or populations is likely to be small.

Question 4

Q

What are paired samples? Example?

Answer

A

It is data across the same group before and after intervention or under two different conditions

Example - Usually tests before and after such as whether nicotine patches help quit smoking, the two variables are the nicotine patches and whether they smoked before the test and if they smoke after.

Question 5

Q

What is a student t-test?

Answer

A

Difference in means of a group of measures of continuous variables (interval/ratio)

Question 6

Q

3 types of Student t-test? Which test do they correspond to?

Answer

A

Correspond to the tests for nominal/ordinal values

1) One sample t-test - binomial or Chi square goodness of fit
2) Independent/unpaired sample t-test - Chi-square test of association
3)Paired samples test - McNemar’s test

Question 7

Q

When is the One sample t-test used? Difference to Chi? Example?

Answer

A

When we want to test whether the mean of a single sample of continuous data (Interval/ratio) differs from known/hypothesised mean

It differs from chi-squared as it is not categorical data and measures means rather than frequencies/proportions

Example - Comparing 20 Male student Vo2 max tests to 20 published Vo2 max tests.

Question 8

Q

When is the Independent/unpaired samples t-test used? Difference to Chi? Example?

Answer

A

Compares the observed difference between the means of two independent groups

It differs as it measuring means not frequencies, of continuous (numeric) data.

Example - Comparing exam marks from Class A and Class B

Question 9

Q

When is the paired t-test used? Difference to Chi? Example?

Answer

A

Compares the means of two related groups / a group measure on two different occasions and the difference is compared.

Data is continuous, measures means.

Example - Comparing weight of participants before and after lockdown

Question 10

Q

What does Normality mean in terms of t-tests?

Answer

A

It is the assumption of normality, it is when we assume the distribution of the test results are evenly/normally distributed and shows it is accurate, it is often common with a sufficiently large sample size as it is more forgiving

Question 11

Q

What are the tests for sample t-tests?

Answer

A

Parametric tests - Statistical tests based on the normality assumption.

Test of normality - Should not assume normality, so use tests such as Shapiro-Will test to find a p-value to determine significance

Non Parametric tests - Don’t require the normality assumption, focuses more on order and ranking instead of face value data

Equality of variance - Using Levene’s test of EV, to test significance between two variances and whether they are equal, if not equal must do Welch’s test.

Question 12

Q

What are T-Statistics ?

Answer

A

What T-Tests are based off.
Similar to z-score but it is about Mean and SD of sample not population.

Question 13

Q

Equation for T Value?

Answer

A

Degree of freedom (df) = sample size - number of groups

Greater T value means greater difference

Question 14

Q

How is t-test data reported? Example

Answer

A

T value, Degree of freedom and p-value, usually with descriptive statistics such as Mean and SD.

Example - T(48) = -3.1, p = 0.003

T value with 48 degrees of freedom
-3.1 size of the difference
P = .003 = significant difference

Question 15

Q

How to calculate final report ? With example

Answer

A

1) Formulate hypotheses - null and alternative = (no)significant difference between two groups (group a and b exam marks)

2) Calculate means and SD
Given -
A = 78 mean, 10 SD, 25 sample size
B = 85 mean, 12 SD, 25 sample size

3) Calculate standard error and t-stats =
SE = Mean A - Mean B / Square root of (SD A^2 / sample size) = SD B^2 / sample size)
SE = 78 - 85 / square root of 10^2/25 + 12^2/25 = 3.12
T = -7 / 3.12 = 2.24

4) Calculate DF
DF = Sample A + B - 2 (groups)

5) Determine p-value

Question 16

Q

When is the chi-squared goodness of fit test used? Difference to TOA and Binomial? Example?

Answer

A

When analysing categorical data (nominal/ordinal), With more than 2 levels.
We want to see if observed/hypothesised data frequencies are the same expected.

Instead of finding relationship between two or more categories, binomial can only do 2 not more

Example - Rolling a dice to see if it is fair/even, what numbers they get for n-tosses

Question 17

Q

When is the chi-squared test of association used? Difference to GOF? Example?

Answer

A

To determine a relationship between 2 or more categorical (ordinal/nominal) variables

Instead of testing to see if observed frequencies match expected

Example - Whether people in the UK in different age groups have different car preferences

Question 18

Q

Difference between McNemar’s and paired sample t-test?

Answer

A

McNemar’s is categorical and paired sample is continuous