L17 - Regression 1 Flashcards

1
Q

what is a model?

A

model is a mathematical, physical or otherwise logical representation of a system that is used to convey real world events or phenomenon.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is the main difference between correlation and regression?

A

correlation just looks at how two variables are related, Regression looks at dependent and independent variables (X,Y) and explains the relationship between them.
regression attempts to make a causal inferance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

how do you form a Hypothesise a regression model between two variables?

A

determine which variable may be dependent on another (which one is X and Y).
then hypothesize the nature of the relationship, positive or negative?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is the equation for the linear regression model?

A

Y = B0 + B1X + E
where:
B0 = y intercept
B1 = lope of the line
X = indipendente variable
E = Error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what are the assumptions for a simple linear regression (4)

A
  1. Data is interval or ratio
  2. the relationship between x and Y is linear
  3. we can identify one variable
  4. the x variables are measured without error
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what does a residual/error plot do?

A

it plots variables residual scores on a graph to see if a trend is visible. if a trend is visible then the data variables understudy may be linked through another variable not considered.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

in order for the regression model to be accepted there are four requirements that mus be satisfied

A
  1. the errors must be normally distributed
  2. the errors have a mean of zero
  3. the standard deviation of error is constant for all values of X (homoscedasity)
  4. The set of errors associated with different values of Y are all independent.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is error in a regression model?

A

error is the difference between the observed and predicted value of Y. for each value X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

draw out the sum of squares diagram including Total, Unexplained (error) and total

A

refer to slides

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

why do we find the line of best fit? what does it tell us?

A

the line of best fit is a form of model that is used to predict values of Y for given X that may be outside the dataset.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is the Coefficient of determination?

A

R^2 tells us how much of the variation in Y can be explained by X.

          Explained variation  R^2 =\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_
             Total variation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what does R^2 = 0.67 mean?

A

it means that 67% of the variation in Y can be explained by X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is the error term within a residual plot

A

it is the amount of error within a regression model. depending on the distribution of the error there will be different

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is the error term within a residual plot

A

it is the amount of error within a regression model. depending on the distribution of the error there will be different

How well did you know this?
1
Not at all
2
3
4
5
Perfectly