module 11 regression analysis Flashcards
Adjusted R2
- Penalises excessive use of unimportant independent variables
- example: Adjusted R2 of 0.8720 means 87.2% of the variation is explained by the variation in X1, X2, X3….Xk(all X values) considered together, taking into account the sample size and number of independent variables
standard error
can be compared to the average y value
Hypothesis
a supposition or proposed explanation made on the basis of limited evidence as a starting point for further investigation
Null Hypothesis
– a type of hypothesis used in statistics that proposes that no statistical significance exists in a set of given observations. The null hypothesis attempts to show that no variation exists between variables or that a single variable is no different than its mean
-It is presumed to be true until statistical evidence nullifies it for an alternative hypothesis.
Alternative Hypothesis
– the hypothesis used in hypothesis testing that is contrary to the null hypothesis. It is usually taken to be that the observations are the result of a real effect
Dummy variable
– also known as an indicator variable or quantitative variable is one that takes the value of 0 or 1 to represent a non-numerical variable e.g. smoker/non-smoker