Stats Test 3 / Final Flashcards

1
Q

What category of relationships is needed for Linear Regression?

A

Q -> Q

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does Linear Regression not represent?

A

Steepness of relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are x and y generally?

A

x = explanatory variable
y = response variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is regression notation?

A

y-hat = a + bx

a = intercept
b = slope
y-hat = predicted y-value (mean of y for given x)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Words for regression

A
  • best-fitting line
  • least squares line
  • least squares regression line
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Linear regression minimizes what?

A

vertical residual (distance between line and point up and down)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the equation for residual error or prediction error

A

y - y-hat

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

“Simple” formulas for slope and intercept

A

b = r (sd-y / sd-x)
a = Y-bar - b*X-bar

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is extrapolation?

A

use of regression line to estimate mean of y for x far outside x-range of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Simpson’s Paradox

A

It is just a bias introduced by failing to account for the lurking variable—an arithmetic phenomenon in the calculus of proportions:
(a + b / c + d) > or < a / c and b / d

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Formula for population regression line

A

µy = α + βx

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

CONDITIONS OF REGRESSION MODEL:

A

L I N E:
Linearity: scatterplot should have a linear form
Independence: data come from random samples or a randomized experiment
Normality: no outliers in histogram of residuals
Equal Population Stan. Dev.: no megaphone pattern in scatterplot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Sampling distribution of b

A

(b - β) / SEb
(T- statistic) with df = n - 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

inference for β

A

Confidence interval
b ± t * SEb

Test of significance:
(b - β) / SEb

Hypotheses:
H0: β = β0
Ha: β =/= β0
β < β0
β > β0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Properties of r

A
  • Ranges between -1 and 1
  • Doesn’t relate to slope, only correlation
  • Does not have units of measurement
  • Does not change when units of measurement of either one of the variables change
  • Makes no distinction between explanatory and response variables
  • Heavily influenced by outliers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Guestimations of r

A

Football shape: 0.6
Hotdog: 0.8

17
Q

What measures the “badness” of a line?

A

sum of square errors