Exam Three Flashcards

1
Q

Linear equation for one independent variable

A

Y= b0 + b1x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

First step to analysis

A

Constructing a graph of the data (scatterplot)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Deterministic model

A

An exact relationship where there is 1 value of y for every x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Probabilistic model

A

Allows for variability in y at each x value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Least squares criterion

A

When multiple lines can fit scatterplot, the line with better fit is the one where the sum of the squared errors is smaller

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Calculating error (e)

A

1: find vertical distance between a line and a data point

2:sum of squared errors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

S(XX)

A

Sum of x(I) - mean (x) ^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

S(xy)

A

Sum (xi-x bar)(yi-y-bar)^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

S(yy)

A

Sum (yi-y bar)^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

B1=

A

B1= sxy/sxx

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

B0=

A

Mean y - b1 * mean x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What do these equations help us do?

A

Find the regression equation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How to examine the utility of a regression

A

Determine percentage of the variation in observed values of y that is explained by x (define both variation and amount of variation)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

SST

A

SST: sum of yi-y bar squared
This is a measure of total variation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

SSR

A

Sum of y hat - y bar squared
Measure of the amount of variation in the dependent variable explained by regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

SSE

A

Yi- y hat i ^2
Variation in y NOT explained by the regression

17
Q

Relation between SST, SSR and SSE

A

SST = SSE + SSR
(Total = unexplained + explained)

18
Q

R^2 (coefficient of determination)

A

Percentage of variation in dependent explained by fitted regression

Between 0 and 1, near 0 means means it not useful, near 1 means it is

19
Q

R^2 equation

A

SSR/SST

20
Q

Coefficient of determination

A

Measure of how well outcomes are replicated by the model (r^2)

21
Q

MSE

A

Mean squared error. Estimates the average of the squares of errors

MSE= (1/n) sum of (yi - y hat)^2

22
Q

R

A

Aka: correlation coefficient

Measures how strong the linear relationship between x and y is. Ranges between -1 and 1. Strongest on either end, weakest close to 0

R= (sxy)/ sqrt (sxx * syy)
excel: = CORREL (column x, column y)

23
Q

Caution about correlation

A

Correlation can’t prove causation: strong correlation can be produced by chance, effect of 3rd variable, etc.
Near 0 correlation may just mean the variables don’t have a linear relationship.
Outliers can also affect correlation

24
Q

Purpose of sample regression

A

To predict a population regression