Econometrics Flashcards
What would the hypothesis test be for a positive correlation?
H0: ß ≤ 0
H1: ß > 0
What does TSS stand for?
Total Sum of Squares. The total sum of squares is a variation of the values of a dependent variable from the sample mean of the dependent variable.
What does RSS stand for?
In statistics, the residual sum of squares (RSS), also known as the sum of squared estimate of errors (SSE), is the sum of the squares of residuals (deviations predicted from actual empirical values of data). It is a measure of the discrepancy between the data and an estimation model, such as a linear regression.
What does ESS stand for?
Explained Sum of Squares. Explained sum of square (ESS) or Regression sum of squares (RSS) or Model sum of squares is a statistical quantity used in modeling of a process. ESS gives an estimate of how well a model explains the observed data for the process.
It tells how much of the variation between observed data and predicted data is being explained by the model proposed. Mathematically, it is the sum of the squares of the difference between the predicted data and mean data.
What is the formula that links R-Squared with ESS and TSS?
R-Squared = ESS/TSS
What is the formula that links RSS, TSS, and ESS?
RSS = TSS - ESS
TSS = RSS + ESS
ESS = TSS - RSS
What are the 4 steps to perform a hypothesis test?
1) Hypothesis - State the hypotheses:
Null hypothesis (H0): This is the default assumption or claim that you want to test. It represents no effect or no difference.
Alternative hypothesis (Ha): This is the claim or effect you want to support if there is sufficient evidence against the null hypothesis. It represents the direction of the effect or difference.
2) Test Statistic - Calculate the test statistic by first calculating the standard error
3) Critical Value - Find the critical region by calculating the critical value using the significance level, type of tail test, and degrees of freedom
4) Decision - Compare the test statistic to the critical value or p-value.
If the test statistic falls in the critical region (beyond the critical value) or the p-value is smaller than the significance level (α), reject the null hypothesis.
If the test statistic falls in the non-critical region (within the critical value) or the p-value is larger than the significance level (α), fail to reject the null hypothesis.
Which test would one use to assess the statistical significance of the regression coefficients?
T-Test
Which test would one use to assess if the the overall regression is statistically significant?
F Test
What are the 6 steps to conduct a T-Test?
1) State the null hypothesis (H0) and the alternative hypothesis (Ha):
2) Calculate the test statistic:
Calculate the sample means (x̄1 and x̄2) for group 1 and group 2.
Calculate the sample standard deviations (s1 and s2) for group 1 and group 2.
Calculate the standard error of the difference between the means using the formula:
SE = sqrt[(s1^2 / n1) + (s2^2 / n2)], where n1 and n2 are the sample sizes of group 1 and group 2, respectively.
3) Calculate the t-value using the formula:
t = (x̄1 - x̄2) / SE
4) Determine the degrees of freedom (df):
The degrees of freedom can be calculated using the formula:
df = n1 + n2 - 2, where n1 and n2 are the sample sizes of group 1 and group 2, respectively.
5) Determine the critical value or p-value:
Look up the critical value from the t-distribution table based on the chosen significance level and degrees of freedom.
6) Decision
If the absolute value of the t-value is greater than the critical value, reject the null hypothesis.
If the p-value is smaller than the significance level (α), reject the null hypothesis.
What are the 6 steps to conduct an F Test?
1) State the null hypothesis (H0) and the alternative hypothesis (Ha):
2) Determine the sample sizes (n1, n2, …, nk) and the sample variances (s1^2, s2^2, …, sk^2).
3) Calculate the test statistic:
Calculate the ratio of the largest sample variance to the smallest sample variance:
F = (s1^2 / s2^2) or (s2^2 / s1^2), depending on which variance is larger.
Note: The order of calculation for the ratio is important, as F is always a positive value.
4) Determine the degrees of freedom:
The degrees of freedom for the numerator (df1) is equal to the number of groups minus 1 (k - 1).
The degrees of freedom for the denominator (df2) is equal to the total sample size minus the number of groups (n - k), where n is the total sample size.
5) Determine the critical value or p-value:
Look up the critical value from the F-distribution table based on the chosen significance level, df1, and df2.
6) Decision
If the F-value is greater than the critical value, reject the null hypothesis.
If the p-value is smaller than the significance level (α), reject the null hypothesis.
Define the F Test
An F-test is used to test whether two population variances are equal. The null and alternative hypotheses for the test are as follows:
H0: σ12 = σ22 (the population variances are equal)
H1: σ12 ≠ σ22 (the population variances are not equal)
The F test statistic is calculated as s12 / s22.
If the p-value of the test statistic is less than some significance level (common choices are 0.10, 0.05, and 0.01), then the null hypothesis is rejected.
Define the T Test
A two sample t-test is used to test whether or not the means of two populations are equal.
A two-sample t-test always uses the following null hypothesis:
H0: μ1 = μ2 (the two population means are equal)
The alternative hypothesis can be either two-tailed, left-tailed, or right-tailed:
H1 (two-tailed): μ1 ≠ μ2 (the two population means are not equal)
H1 (left-tailed): μ1 < μ2 (population 1 mean is less than population 2 mean)
H1 (right-tailed): μ1> μ2 (population 1 mean is greater than population 2 mean)
The test statistic is calculated as:
Test statistic: (x1 – x2) / sp(√1/n1 + 1/n2)
where x1 and x2 are the sample means, n1 and n2 are the sample sizes, and where sp is calculated as:
sp = √ (n1-1)s12 + (n2-1)s22 / (n1+n2-2)
where s12 and s22 are the sample variances.
If the p-value that corresponds to the test statistic t with (n1+n2-1) degrees of freedom is less than your chosen significance level (common choices are 0.10, 0.05, and 0.01) then you can reject the null hypothesis.
What is the formula for TSS?
∑y^2 -n(ȳ)^2
Where:
Σ denotes the sum of
y represents each observed value of the dependent variable
ȳ represents the mean of the dependent variable
What is the formula for standard error?
se(β ̂ ) = √(σ ̂^2/(∑(X-X ̅ )^2 )) = √(σ ̂^2/(∑X^2 -nX ̅^2 ))