Week 9 Flashcards

1
Q

What is Regression Analysis

A

Study of the relationship between variables, how a dependent variables depends on the independent variable(s)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is Causal Relationship?

A

An increase in variable X will cause variable Y to change, use random experiment/instruments to prove

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What to do with outliers data?

A

If it’s clearly not a part of the population interest, delete them
If unsure, run the analysis with and without the outliers and present both

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is correlations?

A

Measures that indicate the strength of linear relationships between pairs of variables; usually between -1 and 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the limitation of correlations?

A

Only for linear relationships, if correl is close to zero, could be because of non-linear relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a fitted value?

A

Predicted value of the dependent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is “e”, Y, and Y hat?

A

Y = Actual Value
Y hat = Fitted Value
e = Residual

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is standard error of estimate?

A

Indicates the level of accuracy of predictions, the smaller the better

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is R Square?

A

Fraction of variation of the dependent variable, value is 0-1, the closer to 1, the more explainable it is

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How the interpretation of the slope coefficient differs between simple regression and multiple regression?

A

Simple regression: If the independent variable change by 1 unit, the dependent change by … unit
Multiple regression: If independent variable X changes by 1 unit, EVERYTHING ELSE CONSTANT, the dependent variable increase by b1

Formula for multi: Predicted Y = a + b1X1 + b2X2 + …

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

One flaw of R Square and how to fix it

A

R Square always increase as independent variables increase, thus use Adjusted R Square

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do we know if we successfully adding variables to the regression model?

A

Check adjusted R Square (Increase)
Check standard error of estimates (decrease)

If R Square decreases and/or standard error of estimates increases, this means that the newly added variable is BAD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How do we navigate categorical variables (male/female)?

A

Use Dummy Variable, 0/1, not or yes
If more than 2 categories, make different dummy variables (but must be number of category -1)
e.g. Q1, Q2, Q3, Q4 -> Q1 1/0, Q2 1/0, Q3 1/0 ONLY

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a general linear equation look like?

A

Predicted Y = a + b1X1 + b2X2 + …
Note: Can be: LogX2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is nonlinear transformation?

A

It is when there is dependent or independent variables that is not an original data from the dataset
E.g. When they are interaction variables which are variables that resulted from products of or transformed from the original variables (?)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is and what are some typical nonlinear transformations?

A

When you can transform the dependent variable Y or any independent/explanatory variables
E.g. Natural logarithm (ln), square root, reciprocal, square
Purpose is to straighten out points in scatterplot

17
Q

What is instrument variables?

A

Variable that should be dependent to the variable but not related to other variables
e.g. number of neighbor country with number of export & import; but not with the GDP