Exercise insights Flashcards

1
Q

How do you calculate correlation in libre office calc

A

Mark the columns you want to correlate. In header click Data -> Statistics -> correlation. Otherwise you can use the correl function.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why is the correlation coefficient not the slope of an estimated line of equation for the relationship between two variables

A

Because the correlation coefficient is a measure of the strength of their relationship aka how well you can make a line of the scattered dots not how that line points, that is the job of the regression line.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Is the concept of dependent and independent variables related to correlation or regression

A

Regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Is correlation a measure of how large the covariance is between two variables

A

Yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How do you get the estimated slope coefficient of two variables in libre office calc

A

From the header you click Data -> Statistics -> Regression, you then mark the dependent and independent variable and where you want the result wereafter you get the slope coefficient under Coefficients next to X1 if you only have two variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does it mean when we say that the estimated slope coefficient is statistically significant.

A

That there is strong evidence that the relationship between the independent variable and the dependent variable in a regression model is not due to random chance. A relationship is statistically significance if the p-value is low and there is no third factor that influences both

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Can a regression model as a whole be statistically insignificant

A

Yes if the independent variables do not collectively explain a significant portion of the variation in the dependent variable. This can be due to a small sample size, noise in the data, a strong correlation between the independent variables or simply a weak relationship between the independent variables and the dependent ones.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What happens when the slope coefficient is equal to zero

A

It means there is no relationship between the variables detectable with a regression model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

If the p-value is high and the t-value is low is the relationship statistically significant

A

No if the t-value is high it indicates a strong relationship between the variables and when the p-value is low it means that the chance of observing such a t-value under the null hypothesis is very small. So if the t-value is high and the p-value is small it means that the null hypothesis of b being 0 is probably false and the model is statistically significant.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How do you insert a trend line and function in a scatter plot in libre office calc

A

You create a scatter plot by marking the related columns and clicking the chart icon were-after you choose scatter-plot. Then after you have made your scatter-plot you can right click on the dots you want a trend-line of and choose trend line and function in the drop-down menu.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

The residual is the diference between the average dependent variable and the declared dependent varaible

A

No, thats the variance of that instance. The residual is the diference between the prediction of the estimator and the actual value at that point.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

In a quadratic regression model the deriviative can be interpreted as the increase when x changes by one unit

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

In a linear probability model one can count on the model being restricted between 0 and 1

A

False, lines continue to infinity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

In which regression models does B1/100 represent a UNIT change in y at a 1% change in x. Linear, log-log, exponential or logarithmic

A

Only logarithmic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

In the log log model the coeficient is…

A

The expected percentage change in y when x changes by 1%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

If the correlation is -0.87 and one variable increases by 1 the expected change in the other variable is -0.87

A

False becouse correlation does mesure their relationship but the strength of their covariance. The only thing we get is that the relationship is a strong negative one.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

If the given regression model is y = b1x + b2x + b3x + b0 + e how do you make a restricted function for a restriction test

A

y = b1(x1+x2) +b3x + b0 + e

18
Q

If you answer that one is 2.5 percent likelyer for each year in education to be employed in a linear regression question why are you automatically wrong

A

Becouse percentage changes implies a logarithmic relationship. Name percentage points instead.

19
Q

How do you write a hypothesis test of joint significance

A

H0: B1 = B2… Bn = 0
HA: any B value is not 0

20
Q

How does the logit model look, what if the beta is 0.5

A

the logit model looks like an s and if the beta is 0.5 that is what multiplies the exponentiation both in the top and the bottom

21
Q

What assumptions of the OLS estimator is required for it to be unbiased

A

It needs to be linear in the parameters, it needs to fufill E(u|x) = 0 and it needs to lack perfect collinearity.

22
Q

What assumptions of the OLS estimator is required for the standard error and thus T and F tests to be valid.

A

That the samples shuld be independent from an identical distribution i.i.d and that there should be no large outliers.

23
Q

What assumptions is required for the OLS estimator to be BLUE the best linear unbiased estimator

A

it needs the errors to be homoskedastic for the Gaus Markov theorem to hold true and it needs the errors to be normally distributed

24
Q

Is the law of large numbers about consistency

A

Yes, and consistency means that the probability of the expected value of the estimator being that of the population approaches 1 as the sample size increases

25
Q

The central limit theorem is about consistency

A

No its about the distribution aproaching normal as the sample size increases.

26
Q

i.i.d leads to consistency

A

True

27
Q

What makes an estimator good

A

It is unbiased E(sample) = E(population, it is consitents E(E(sample) = E(population)) -n-> 1, an dit is efficient. var = small

28
Q

What may cause E(u|X) = 0 no endogineity to not hold

A

If there is correlation, or non linear relation between x and the residual or if there is a third factor that influences them both

29
Q

If you want your estimator to be efficient should you pick a sample with a larger or smaller variance

A

A larger variance becouse the variance of b decreases with the variance of X. It also decreases with the variance of n

30
Q

The requirement for an omitted variable to cause bias is

A

Correlation with model variables and an effect on the response variable

31
Q

What is an idea randomized controlled experiment

A

Ideal as subjects follow protocol perfectly. Randomized as i.i.d sampling. Controlled as having a control group for an effect without a cause. And lastly an experiment where people are not vaulentearing which would cause selction bias and reverse causality.

32
Q

What will large but imperfect multicolinearity lead to

A

Large standard errors for one or more of the regressors

33
Q

How can you make the OLS estimator be as good as possible. Make it artificially BLUE

A

You can use robust stadard errors for the gauss markov theorem to hold by nullifying the advers effect of heteroskedacity and you can rely on the central limit theorem to make the errors normally distributed when n is large.

34
Q

are the coefficients in a multiple regression model generally independently distributed

A

No they are not as they often are somewhat correlated their effect on y becomes muddled.

35
Q

F test assumes homoskedasity

A

True

36
Q

How do you make a hypothesis test to check if a relation is polynomial of a higher order than one.

A

You make an F test on the higher order variables.

37
Q

Can adding a non-linear regressor introduce multicolinearity

A

Yes although it is often not perfect as the relation is not linear although an F-test might be needed.

38
Q

What is probit

A

Using the cumulative normal distibution function to check the likelyhood of Y being zero dependent on x. Pr(Y=1|x) = C(b0+b1x). b0+b1x is the z-value

39
Q

logit gives similar results to probit

A

True

40
Q

Is R² menaingful in probability models

A

No

41
Q
A