Unit 2 Flashcards

1
Q

birative data

A

comparing two sets of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

explanatory variable

A

the independent variable or the variable that impacts the other variable; resides on the x axis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

response variable

A

the dependent variable or the variable that is impacted by the other variable; resides on the y axis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

correlation coefficient

A

a number between -1 to 1 that tells us the strength and direction of a relationship. It can be represented by the equation (zx*zy)/n-1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Least Squares Regression Line

A

best fit line for the scatterplot; represented by y=mx+b

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

sum of squared errors

A

its another way to see how far the points are from the line; the smaller the residual, the better fit line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

residual or error

A

observed - predicted

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

extrapolation

A

when the data is outside of the domain

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

lurking variable

A

a hidden variable that could be impacting both the explanatory and response variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

leverage

A

points extraordinary in x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

influential point

A

a term to name if an outlier or leverage point has the potential to change the slope or lsrl if removed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

coefficient of determination (R^2)

A

the percent of variability in the y variable can be explained by x variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

regression to the mean

A

the tendency for extreme occurances to be more average in the future

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

scatterplot

A

a method of representing data between two variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

DOFS

A

(Direction, Outliers, Form, Strength)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Residual Standard Deviation

A

measures how much the points spread around the LSRL

17
Q

Re-Expressing

A

make the form of a scatterplot more nearly linear by taking the square root or log of y (sometimes the x variable too)

18
Q
A