AP Final Flashcards
Inference for Proportions (Conditions)
Random: Data from a random sample(s) or randomized experiment
Normal: At least 10 successes/failures (in both groups, for a two sample problem)
Independent: Independent observations and independent samples/groups; 10% condition if sampling without replacement
Interpret ‘r-squared’
___ % of the variation in y (context) is accounted for by the least squares regression line of y (context) on x (context)
Inference for Means (Conditions)
Random: Data from a random sample(s) or randomized experiment
Normal: Population distribution is normal or large (greater than/equal to 30)
Two Sample t-test (Conclusion)
Fail to reject/reject the null hypothesis (state). We do/do not have enough evidence at the 0.05 level to conclude that the difference between the mean ____ for all ___ and the mean ___ for all ___ is ___.
Chi-Square Tests (df and Expected Counts)
- Goodness of Fit: df = # of categories - 1
Expected Counts: Sample size times hypothesized proportion in each category - Homogeneity: df = (#rows-1)(#columns-1)
Expected Counts: (row total)(column total)/table total
What is Goodness of Fit Chi-Square Test used for?
Use to test the distribution of one group or sample as compared to a hypothesized distribution
What is a Homogeneity Chi-Square Test used for?
Use when you have a sample from 2 or more independent population or 2 or more groups in an experiment. Each individual must be classified based upon a single categorical variable.
What is an Association/Independence Chi-Square Test used for?
Use when you have a single sample from a single population. Individuals in the sample are classified by two categorical variables.
Factors that Affect Power
- Sample size: To increase power, increase sample size
- Increase alpha: A 5% test of significance will have a greater chance of rejecting the null than 1% test.
- Consider an alternative that is garter away from mu naught: Values of mu that are in mu alternative, but lie close to the hypothesized value are harder to detect than values of mu that are far away from mu naught.
Type 1 Error
Rejecting mu naught when mu naught is actually true
Type 2 Error
Failing to reject mu naught when mu naught should be rejected
What is power?
The probability of rejecting mu naught when mu naught should be rejected (rejecting correctly).
Inference for Regression (Conditions)
L-I-N-E-R
Linear: The relationship between the variables is linear
Independent: Observations, 10% condition if sampling without replacement
Normal: Responses vary normally around the regression line for all x-values
Equal Variance around regression line for all x-values
Random: Data from a random sample or randomized experiment
Chi-Square Tests (Conditions)
Random: Data from a random sample(s) or randomized experiment
Large Sample Size: All expected counts are at least 5
Independent: Independent observations and independent sample/groups; 10% condition if sampling without replacement
Can we generalize results to the population of interest?
Yes, if a large random sample was taken from the sample population we hope to draw conclusions about.