Test of association Flashcards

1
Q

What is the Poisson Distribution used for? What are its assumptions?

A

A discrete probability distribution, often used to describe the number of rare events. tells you if events are random

Assumptions: Events are random and independent
Parameter is λ
Mean and variance are equal: μ = σ²= λ

Can make hypothesis about a process and calculate an expected value and work out whether it is close to the true value or not

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does a goodness-of-fit test look for (Chi-squared)?

What are the different notations used?

A

Evidence that observed data is consistent with some claimed distribution

Looks for associations between categorical predictor variables= Not continuous

Notation:
n total number of trials
k number of possible outcomes for each trial (how many categories there are in total)
O Observed frequencies (counts) [# trials associated with the outcome]
E Expected frequencies (counts) [# trials associated with the outcome]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

When do you reject the null hypothesis?

What does that tell you?

A

If χ^2 > χ^2crit
The larger the test statistic, the bigger the difference and therefore the more significant it will be

Know that there is some association but the test does not tell us which categories within the variables differ

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

When should a Chi squared test be used?

A

If the expected number of counts in all categories is 5 or more

If expected number of counts in one or more categories is less than 5, should merge one or more categories to create new categories so that each has an expectation of 5 or more

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does a contingency table show?

A

Describes frequencies corresponding to paired variables

Tests for any association between two categorical variables- does knowing one variable help predict the other?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do you calculate expected values?

A

Using the observed table, take the totals from the row and the column, multiply them together and divide them by the overall total

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How do you interpret P-values near 0.05?

A

Think about what might have happened if one or a few of the sample data were slightly different- could a small change make you change from retaining or rejecting the null hypothesis?

Do not make a strong claim about the results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly