Reading Quiz 14 Flashcards

1
Q

three chi-square procedures

A
  1. chi-square goodness of fit test
  2. chi-square test for homogeneity of populations
  3. chi-square test of association/independence
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

chi-square test for goodness of fit

A

tests the null hypothesis that a categorical variable has a specific distribution
aka X^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

chi-square test for homogeneity of populations

A

tests the null hypothesis that the distribution of a particular categorical variable is the same for all of the populations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

chi-square test of association/independence

A

tests the null hypothesis that there is no relationship between two categorical variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

expected count

A

the expected count for any variable category is obtained by multiplying the hypothesized proportion of the distribution for each category times the sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

chi-square statistic

A

X^2 = ∑ ((observed count - expected count)^2) / expected count aka ∑((O-E)^2)/E
where sum is over k variable categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

chi-square test compares the value of the statistic

A

X^2 with critical values from the chi-square distribution with k-1 degrees of freedom, where k = the number of categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Ho and Ha for chi-square GOF

A

Ho: the population proportions equal the hypothesized values (provide them)
Ha: at least one of the population proportions differs from its hypothesized value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

p-value is the

A

area under the density curve to the right of X^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

large values of X^2 are evidence

A

against Ho

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

the chi-square distribution is an approximation to the distribution of

A

the statistic X^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

can safely use the approximate (aka conditions) when sample is

A

an SRS from the population and when all expected counts are at least 1 and no more than 20% of all expected counts are less than 5 (state the expected counts!)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

if the chi-square test finds a statistically significant p-value, you are technically supposed to do a

A

follow-up analysis that compares the observed counts with the expected counts and that looks for the largest components of the chi-square statistic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

two-way tables

A

first compute percents or proportions that describe the relationship of interest
then turn to formal inference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

two different methods of generating data for two-way tables lead to the

A

chi-square test for homogeneity of populations and the chi-square test of association/independence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

chi-square test for homogeneity of populations

A

independent SRSs are drawn from each of several populations
each observation is classified according to a categorical variable of interest
null hypothesis is that distribution of categorical variable is same for all of the populations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

one common use of the chi-square test for homogeneity of populations is to compare several

A

population proportions
the null hypothesis is that all of the population proportions are equal
the alternative hypothesis is that they are not all equal but allows any other relationship among the population proportions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

chi-square test of association/independence

A

a single SRS is drawn from a single population
observations are classified according to two categorical variables
null hypothesis is that there is no relationship between the row variable and the column variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

expected count

A

the expected count in any cell of a two-way table when Ho is true is
expected count = (row total * column total) / n
where n = sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

chi-square statistic

A

X^2 = ∑(O-E)^2 /E

where sum is over all r*c cells

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

the chi-square test compares the value of the statistic X6@

A

with critical values from the chi-square distribution with (r-1)(c-1) degrees of freedom
r = the number of rows
c= number of columns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

p-value is the

A

area under the density curve to the right of X^2

larger values of X^2 are evidence against Ho

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

chi-square distribution approximation to the distribution of

A

the statistic X^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

can safely use this approximation aka the conditions when all expected cell counts

A

are at least 1 and no more than 20% of all expected cell counts are less than 5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

for an independence/association test the sample must be gathered

A

by an SRS from the population

26
Q

for homogeneity all of the samples

A

must be independent SRSs from their respective populations

27
Q

Suppose that you are dealing with a situation where there are several possible outcomes, not just 2 (success and failure). You are interested in seeing whether the proportion of outcomes falling into each of a certain set of categories is consistent with a certain hypothesized population distribution. What is the name of the test you use?

A

chi-square test for goodness of fit

28
Q

Suppose that your hypothesized population distribution for the percent of objects that are certain colors is 20% black, 50% white, and 30% green. Suppose you draw a sample of 200, to test this hypothesis. What are the “expected” values that you use when you do the chi-square goodness of fit test?

A

40, 100, 60

29
Q

In testing the hypothesis mentioned in Q2, suppose your observed counts are 45, 90, and 65. What does chi-square equal for this goodness of fit test? Please write a numerical expression without calculating the result.

A

A. chi-square = (45-40)^2/40 + (90-100)^2/100 + (65-60)^2/60

30
Q

Is there just one chi-square distribution, or a family of distributions, with one distribution for each number of degrees of freedom?

A

A. A family, with one distribution for each number of degrees of freedom.

31
Q

How do you find the number of degrees of freedom for a chi-square goodness of fit test? For example, how many degrees of freedom would there be if you were looking at the proportion of blacks, whites, and greens as in Q2?

A

A. The degrees of freedom is one less than the number of categories in the distribution; for example, when there are blacks, whites, and greens, the number of degrees of freedom is 3-1=2.

32
Q

When you look up in a table or a calculator the P-value associated with a certain chi-square, what is that the probability of?

A

A. The probability of obtaining results as extreme as, or more extreme than, the ones you obtained, if the hypothesized distribution is true. (Extreme means deviant from what is expected.)

33
Q

Is the chi-square distribution symmetrical? If not, in which direction is it skewed?

A

skewed to the right

34
Q

When you are doing a chi-square test for goodness of fit, what are the hypothesis H0 and the alternative hypothesis Ha?

A

A. The H0 is that the population percents are equal to the set of hypothesized percents. The Ha is that the population percents do not equal that set of hypothesized percents.

35
Q

What are the rule of thumb conditions for the use of the chi-square goodness of fit test?

A

A. All individual expected counts are at least 1 and no more than 20% of the expected counts are
less than 5.

36
Q

If a chi-square goodness of fit test yields a significant result, what should you inspect before you interpret the results?

A

A. You see which observed counts deviated the most from the expected ones – in other words, you see which cells contributed the most to the chi-square that was calculated. You take these observations into account when interpreting your results.

37
Q

Two-way tables describe relationships between two categorical or continuous variables?

A

categorical

38
Q

When there are multiple comparisons that can be made, what two steps are often carried out?

A

A. First an overall test for evidence of any differences among the parameters being compared, and then a follow-up analysis to decide which parameters differ and to estimate how large the differences are.

39
Q

When doing a chi-square test to compare several proportions, the first step is to set up the table with the numbers in it being (proportions of success and number of trials, or counts of the number of cases falling into each category).

A

A. Counts of the number of cases falling into each category.

40
Q

When there are two categorical variables being displayed in an r by c table (with r rows and c columns), each of the r x c possible categories into which the observations may fall is called a _____ of the table.

A

cell

41
Q

When we are comparing the proportion of successes for three treatment conditions, what null hypothesis would we use?

A

A. That the proportion of successes is the same among all three conditions, i.e. that p1 = p2 = p3.

42
Q

When comparing the proportion of successes for three treatment conditions, what would be the alternative hypothesis?

A

A. That not all the proportions are equal.

43
Q

In testing Ho via chi-square with a two-way table, we compare the observed counts with the expected counts. Evidence against Ho consists of observed and expected counts that are far from each other or close to each other?

A

far from each other

44
Q

How do you compute the expected count in a certain cell of a two-way table?

A

A. The expected count is the (row total * column total)/table total.

45
Q

The calculation of the expected value for a cell of a two-way table assumes what relationship between the row and column variables is (disjoint or independent).

A

independent

46
Q

When you want to test the statistical significance of the deviation of observed from expected counts, in a two-way table, using chi-square, how do you compute the chi-square statistic?

A

A. chi-square is the summation of the (observed count - expected count)^2/expected count. The summation is over all r * c cells of the table.

47
Q

Large values of chi-square are evidence for, or against Ho? Why?

A

A. Against. This is because chi-square will be bigger, the bigger are the deviations of observed
counts from those that would be expected under Ho.

48
Q

How many degrees of freedom do you have in a chi-square test with an r * c two-way table?

A

A. (r-1)(c-1)

49
Q

True or False: when doing chi-square tests, the p-value is always the area under the distribution curve that is to the right of the observed chi-square, and never the area to the left.

A

A. True. For the chi-square distribution, the farther you go to the right, the more you have deviated from the null hypothesis. The value most consistent with the null hypothesis is 0, which is the left end of the domain for the function. To get the probability of results as deviant as, or more deviant than, the obtained results, you look at the probability under the curve to the right of the obtained results. (This includes the probability exactly at the obtained results, but since chi-square is a continuous function, the distinction between “above” and “at or above” is not meaningful.)

50
Q

What cell counts are required for doing a chi-square test for homogeneity of populations?

A

A. The same as for tests of goodness of fit: all expected counts are 1 or greater, and no more than
20% of the expected counts are less than 5.

51
Q

In the special case of a two-by-two table (r=2 and c=2), how many cell counts need to be 5 or greater in order to do a chi-square?

A

all four of them

52
Q

How many degrees of freedom would be used for a 3 by 2 table?

A

A. (3-1)*(2-1) = 2

53
Q

After having done an overall test rejecting the hypothesis that all the proportions are equal, what should be done?

A

A. A follow-up analysis that asks which cells most contribute to the deviation from expectations under the null hypothesis. You can do this informally by observation; there are more formal methods that do significance tests and confidence intervals for the individual proportions.

54
Q

True or False: the chi-square tests the hypothesis that “the row and column variables are not related to each other,” even when it is difficult to conceive of the groups defined by the rows and columns as different populations, i.e. when you are dealing with the relation of some variables in one population.

A

true

55
Q

True or False: for a chi-square test of association/independence of variables, you compute the expected counts just as in the other situations: the row total * column total/ table total.

A

true

56
Q

True or False: converting table entries to percents is not necessary for the computation of chi-square, but it does help to shed light on the association among the variables.

A

true

57
Q

For a chi-square test of association/independence of variables, what is the null hypothesis?

A

A. That the variables are independent, or that there is no association between them.

58
Q

True or False: the distinction between tests of homogeneity of populations and tests of association/independence is that in the first, there is a sample from each of two or more populations, and in the second, there is a single sample from a single population.

A

A. True. (However, distinguishing whether there is one or more than one population involved in a study can be a debatable procedure. If you collect a sample of people, some of whom are wealthy and some of whom are poor, can you argue that you have sampled some individuals from the population of poor people and some from the population of rich people? Or have you drawn from one population of people, who simply differ in one variable? Fortunately, the chi-square test is done in the same way regardless of the outcome of such a debate.)

59
Q

When there is a two-by-two table, and you wish to compare two proportions, how will a two-sided z test for equality of proportions and a chi-square test compare with respect to the p values that result?

A

A. The same p values will result.

60
Q

If there is a two-by-two table and you wish to compare two proportions, which test is usually recommended, between a z test and a chi-square, and why?

A

A. The z test has the advantages that it is related to a confidence interval for the difference in proportions, plus you can do a one-sided test if desired.