Lecture 13 - Regression Analysis Flashcards
What kinds of mathematical models can be used in regression analysis?
Range from very simple (few parameters) to very complex (many parameters)
What is regression analysis?
A technique to fit a mathematical model to data
What is the simplest non-trivial model that can be used in regression analysis?
The linear model - y = a + bx
What are the regression coefficients in the linear model?
a and b
What can regression analysis be used for?
Predicting the value of y for a given value of x - extrapolation/interpolation
How do you carry out a regression analysis on SPSS?
Analyse|Regression|Linear… Statistics, tick ‘estimates’ and ‘confidence intervals’.
Need to know which way the dependency goes
In a regression analysis on SPSS, what are the different outputs and of those, which are useful?
variables entered * model summary (r and r^2) ANOVA *** coefficients (a and b) residuals
In a regression analysis SPSS output, what does the R Square mean?
It is the percentage of variance explained by the IV. E.g. .496=49.6%
Using the coefficients regression analysis output box on SPSS, how would you predict values of y for x?
y = a + bx
In the unstandardised coefficients box the top one is a and the bottom b, and the confidence intervals can also be used to give the likely range.
What is the difference between correlations and regression analysis?
Correlations show a relationship and the contribution of noise, but not what the relationship is, whereas regression analysis tells what the relationship is but not whether it is real - a regression analysis will always yield coefficients, even if they’re meaningless.
What does a regression analysis with multiple IVs look at?
Each IV’s contribution to the DV and the plane of best fit.
What is the regression analysis equation for two IVs?
z = ax + by + c
How do you carry out a multiple IV regression analysis in SPSS?
Analyse|Regression|Linear
How is the output for a multiple IV regression analysis in SPSS different to the output of a single IV analysis?
There are additional coefficients, still in alphabetical order though!
What is the problem with using more than 1/2 IVs in a regression analysis?
As the number of variables increases, it becomes harder to portray the data. 3-D graphs are confusing.