Chapter 12 Inference on Categorical Data Flashcards
1. Use the conditional distribution to identify association among categorical data. 2.Identify independent events. 3.Compute the mean and standard deviation of a binomial random variable. 4. Test hypotheses regarding two proportions from independent samples. 5. Perform a test for independence. 6. Perform a test for homogeneity of proportions.
Marginal Distribution of a Variable
A frequency or relative frequency distribution of either the row or column variable in the contingency table.
A marginal distribution removes the effect of either the row variable or the column variable in the contingency table.
Conditional Distribution
A list of the relative frequency of each category of the response variable, given a specific value of the explanatory variable in a contingency table.
Contingency Table
A table that relates two categories of data. Also called a two-way table.
Lurking given by E=μ=np.
An explanatory variable that was not considered in the observational study, but that affects the value of the response variable. In addition, lurking variables are typically related to explanatory variables considered in the study.
Simpson’s Paradox
Describes a situation in which an association between two variables inverts or goes away when a third variable is introduced to the analysis.
Multiplication Rule for Independent Events
If two events E and F are independent, then:
P(EandF) = P(E) ⋅ P(F)
The expected value of a binomial random variable for n independent trials of a binomial experiment with probability of success p
It is given by E = μ = np.
Chi-Square Test for Independence
Used to determine whether there is an association between a row variable and a column variable in a contingency table constructed from sample data.
The null hypothesis is that the variables are not associated, or independent.
The alternative hypothesis is that the variables are associated, or dependent.
Finding the Expected Frequencies in a Chi-Square Test for Independence
Multiply the cell’s row total by its column total and divide this result by the table total. That is,
Chi-Square Test for Independence Using the TI-84 Calculator
- Access the MATRIX menu. Highlight the EDIT menu, and select 1: [A].
- Enter the number of rows and columns of the contingency table (matrix).
- Enter the cell entries for the observed matrix, and press 2nd QUIT.
- Press STAT, highlight TESTS, and select C: $\chi ^2$-Test.
- With the cursor after Observed:, enter matrix [A] by accessing the MATRIX menu, highlighting NAMES, and selecting 1:[A].
- With the cursor after Expected:, enter matrix [B] by accessing the MATRIX menu, highlighting NAMES, and selecting 2:[B].
- Highlight Calculate or Draw, and press ENTER.
Results will be the test statistic and P-value.
Chi-Square Test for Independence or Homogeneity of Proportions
- Enter the given contingency table into a worksheet, including column and row labels.
- Select XLSTAT > Correlation/Association tests > Tests on contingency tables (Chi-square…)
- General Tab: Fill in the box as follows:
Contingency table: Highlight the contingency table cell range (such as A1:D4)
Data format: Select the contingency table option.
Range/Sheet/Workbook: Select the Sheet option.
Labels included: Check this box. - Options Tab: Check the box for Chi-square test
- Outputs tab: Check the boxes for Theoretical frequencies, Proportions/Row, and Proportions/Column. Also select Proportions. Click OK.
Chi-Square Test for Homogeneity of Proportions
A test of whether different populations have the same proportion of individuals with some characteristic.
Chi-Square Test for Homogeneity of Proportions Using TI-84 Calculator
- Access the MATRIX menu. Highlight the EDIT menu, and select 1: [A].
- Enter the number of rows and columns of the contingency table (matrix).
- Enter the cell entries for the observed matrix, and press 2nd QUIT.
- Press STAT, highlight TESTS, and select C: χ2-Test.
- With the cursor after Observed:, enter matrix [A] by accessing the MATRIX menu, highlighting NAMES, and selecting 1:[A].
- With the cursor after Expected:, enter matrix [B] by accessing the MATRIX menu, highlighting NAMES, and selecting 2:[B].
- Highlight Calculate or Draw, and press ENTER.
What if the requirements for performing a chi-square test are not satisfied?
The researcher has one of two options:
(1) combine two or more columns (or rows) to increase the expected frequencies or
(2) increase the sample size.