Midterm Flashcards
What is the total area under the density curve?
100% or 1
What percent of observations will fall within 1 standard deviation of the mean of an approx. Normal distribution?
68%
What percent of observations will fall within 2 standard deviation of the mean of an approx. Normal distribution?
95%
What percent of observations will fall within 3 standard deviation of the mean of an approx. Normal distribution?
99.7%
What is “r”?
the correlation r measures the strength of a linear relationship
What values can r take? What does it mean when r is less than 0?
r can take any value from -1 to 1. If it’s less than zero, it describes a negative correlation.
What does it mean when r = 1? r = -1?
perfect correlation/points on a scatterplot lie exactly on a straight line; r = -1 means perfect negative correlation
What is the slope b of a regression line y-hat = a + bx?
the predicted change in y-hat when x increases by 1 unit
What does the standard deviation of the residuals measure?
typical size of prediction errors when using the regression line
What does the coefficient of determination r^2 measure?
fraction of the variation in the response variable that is accounted for by the least-squares regression on the explanatory variable
Define influential observations
Individual points that substantially change the correlation or the regression line; outliers are often influential for the regression line
The least squares regression line of Y on X is the line with slope B= ? and intercept A = ?
r(Sy / Sx)
YMean - bXMean
What are the 4 basic principles of experimental design?
- Comparison: use a design that compares 2 or more treatments
- Random assignment: use chance to assign experimental units to treatments. This helps create roughly equivalent groups before treatments are imposed.
- Control: keep as many other variables as possible the same for all groups. Control helps avoid confounding and reduces the variation in responses, making it easier to decide whether a treatment is effective.
- Replication: impose each treatment on enough experimental units so that the effects of the treatments can be distinguished from chance differences between the groups.
Describe a randomized block design
A randomized block design forms groups of experimental units that are similar with respect to a variable that is expected to affect the response. Treatments are assigned at random with in each block. Responses are then compared with in each block and combined with the responses of other blocks after accounting for the differences between the blocks.
Describe a matched pairs design
A matched pairs design is a common form of blocking for comparing just two treatments. And some matched pairs designs each subject receives both treatments in a random order. And others to very similar subjects are paired and the two treatments are randomly assigned within each pair