Regression Line Test Flashcards

1
Q

When iare correlation and linear regression models used for analysis

A

When comparing 2 continuous (interval and ratio).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

When are linear regression models used for EDA

A

when the y-variable is continuos

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Pearson’s Correlation Coefficent (R) outline

A

2 continuos variables. Shows how linearlly 2 variables co-vary. The bigger the spread of points the lower the correlation. Note: only shows linear correlation not correlation overall (needs EDA, night be U shaped)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Covariation Outline

A

The squared difference between the observed value of x or y and the mean value of x bar or y bar(horizontal/verical lines on graph). explains how y changes with respect to x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

EDA Test for Association

A

Scatter plot and observe spread

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Correaltion outline

A

(x - xbar) X (y - ybar). Area of square

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Deterministic outline

A

Spread around line is minimal (most points on line). Strong association, low probability of results occuring by chance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Probabilistic Outline

A

Spread around line is significant (most points aren’t on lome). Low association, high probability results occured by chance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Linearity Outline

A

Constant change in y for every change in x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How is variability around line expressed

A

Probability Distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Regression Line Outline

A

Line of points of all predicted values of y for every value of x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How to evaluate line of best fit for data set

A

Least Square Distance Criterion. Minimise the area of the square formed from the expected and observed and expected y values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

y = mx + c

A

yi = Beta1(xi) + Beta0 + epsilon1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Beta 0

A

y-intercept. value of y when x = 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Beta1 Outline

A

Slope of line. How much y changes for every increase of x. Scale dependent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Epsilon Outline

A

Error. distance from expected to observed values

17
Q

Explained Variance (Signal)

A

Changes undergone by y once x is changed. Beta 0, Regression

18
Q

Unexplained Variance (noise)

A

Difference between observed and expected y. Epsilon, Residual

19
Q

Why you would choose regression line model over correlation

A

Regression line equation provides more information

20
Q

Linear Regression Assumptions

A

Linear relationship between x and y, error (epsilon) is normally distributed, for each x value error variance is the same, errors are independent of eachother