Chi Square- Goodness of fit Flashcards

1
Q

Define nominal, ordinal, interval, and ratio scales of measurement with examples.

A
  • Nominal (or categorical): these are categories with no rank. e.g. hair colour, religion.
  • Ordinal: there are categories that are ranked or ordered. E.g. grades, likert scale. Only >< apply to these.
  • Interval: Quantitative measures without a true zero. They are usually scales constructed to quantify a property. e.g. temp, IQ, ph balance. Can perform +,-
  • Ratio: These are quantitative measure with a true zero. E.g. age, length, time, weight. Can perform +,-,x,/
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the two different ways of classifying variables that determines the type of statistical test we use?

A
  1. Discrete (Nominal and Ordinal)- these can only have certain values within a range.
  2. Continuous (Interval and Ratio)- These can have any value in a range.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Refer back to first powerpoint for inferential test decision

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the difference between discrete and continuous variables?

A
  • Discrete variables have distinct and separate values, often represented by integers, and there are gaps between possible values
  • Continuous variables for a continuum with no gaps between values, can take any value within a range, and are often represented by real numbers.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are classifications and when are they most useful to a researcher?

A
  • Classifications are a form of measurement.
  • They are of interest to a researcher when they are exhaustive and mutually exclusive.
  • Statistical analyses require that data be classified into four scales of measurement (nominal, ordinal,interval, and ratio)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What type of data is the chi-square test primarily used for?

A

Nominal data. Although also ordinal.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does it mean for events to be exhaustive and mutually exclusive?

A
  • Exhaustive means that the data encompasses all the members of a population.
  • Mutually exclusive means that no member of the population can belong to more than one category.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a contingency table?

A

A table used to present data classified with respect to two or more categorical variables.
- They also help us understand the two types of chi-square test.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the two kind sof chi-square test?

A
  • Goodness of fit (unidimensional- one variable) Used to test a single dimension of data.
  • Test of contingency- multidimensional (two variables) Useful to test whether two variables are associated (are they contingent on each other?)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the rule about values in the cells of a contingency table?

A

The values should be absolute frequencies and not proportions or percentages

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the main purpose of the chi-square test?

A

The chi-square test analyses frequency data to determine if there is a significant difference between observed and expected frequencies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the three assumptions underlying the chi-square test? (MEI)

A
  1. Mutually exclusive classification.
  2. Exhaustive categories
  3. Independence of observations (each count should be independent of another)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What three questions does the chi-square test ask?

A
  1. Do the observed values differ significantly from the expected values?
  2. Are the data (O) a good fit to the model (E)
  3. Does the data fit the expected pattern?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How do you find the expected values in a contingency table?

A
  • Usually a 50/50 split but there is a formula.
    Expected frequency= (total of cell rows x total of cell columns)/grant total of all subjects.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the 6 steps for decision making in inferential statistics?

A
  1. Set up the research hypothesis (H1)
  2. Set up the null hypothesis (H0)
  3. Choose a significance level (a)
  4. Calculate the sample statistic
  5. Calculate the probability from hypothetical sampling distribution (p-value)
  6. Decide if result is representative of hypothetical distribution. If unlikely (p<a) reject the null hypothesis. It is reasonable to assume research hypothesis.
    *Note if you’re using one those table where you find the critical x^2 value using df and a, then you can reject the null hypothesis if the chi-squared statistic is greater than the critical value.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the initial steps involved in the goodness of fit chi-square test?

A
  1. Data: observed distribution on a single dimension
  2. Assumptions: Mutually exclusive and exhaustive classification, independent observations
  3. The theoretically predicted population distribution: H0 is observed distribution=expected distribution.
  4. Compare O and E: Calculate chi-square, df=k-1 where k is the number of groups, calculate p.
  5. Decide: if p<0.05 reject H0 therefore the observed distribution is different from the population distribution.
17
Q

What does it mean if the chi-squared statistic is greater than the critical value?

A

The null hypothesis is rejected. Therefore the observed frequencies are unlikely to have occurred by chance.

18
Q

What is the formula for calculating the chi-squre statistic?

A

sum all (o-e)^2/E

19
Q

What is the null hypothesis for a goodness of fit test?

A

THe observed distribution does not differ from the expected distribution.

20
Q

What are the two ways of determining significance?

A
  • Calculate the p-value
  • Find the critical value.