Chapter 4 Flashcards

1
Q

data in which two variables are measured on each individual

A

bivariate data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

graph that shows the relationship between two quantitative variables measured on the same individual - represented by points

A

scatter diagram

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

when above-average values of one variable are associated with above-average values of the other variable (or below/below)

A

positive association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

measure of the strength and direction of the linear relation between two quantitative variables - not resistant to outliers

A

linear correlation coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

relation that creates a U shape on the scatter diagram

A

quadratic relation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

makes residuals as small as possible

A

least-squares regression line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

difference between predicted value and actual value

A

residual

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

point where the line intersects the vertical axis

A

y-intercept

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

predicting outside the range of the x-values in sample data - can result in bad predictions, not recommended

A

extrapolation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

proportion of variation in the response variable that is explained by the least-squares regression line - number between 1 and 0 (0 means no explanatory value, 1 means 100% explanation)

A

coefficient of determination (R2)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

differences between predicted value and actual value as a result of other incidental factors and random error

A

deviations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

deviation due to difference between observed value and mean value of response variable (unexplained + explained)

A

total deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

deviation due to difference between predicted value and mean value of response variable (below line)

A

explained deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

deviation due to difference between observed value and predicted value (above line)

A

unexplained deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

scatter plot where the explanatory variable is plotted on the horizontal axis and the corresponding residual is on the vertical axis - if it shows a discernable pattern, variables may not be linearly related

A

residual plot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

constant error variance - if a residual plot shows residuals increasing/decreasing as the explanatory variable increases, linear model is violated

A

homoscedasticity

17
Q

an observation that significantly affects the least-squares regression line’s slope / y-intercept and the correlation coefficient

A

influential observation

18
Q

AKA two-way table - shows relationship between two qualitative variables (row variable and column variable)

A

contingency table

19
Q

boxes in a contingency table that represent the intersections of the row variable and column variable values

A

cell

20
Q

frequency / relative frequency distribution of either the row or column variable in the contingency table

A

marginal distribution

21
Q

lists the relative frequency of each category of the response variable, given a specific value of the explanatory variable in the contingency table

A

conditional distribution

22
Q

when an association between two variables inverts or goes away when a third variable is introduced to the analysis

A

Simpson’s paradox