non linear regression Flashcards

Question

what are polynomials

Answer 1

the Y = b₁x + b₂x² Extended to the full range so could have x³ x⁴ etc

Answer 2

we want to generate an equation that would help us predict something. e.g., predict reading scores at age 8 based on linguistic abilities at age 5

Answer 3

Run linear model, look at R2. then run non linear model and see which explains more of the variance look at the coefficients table - see if (as well as the linear contribution) the quadratic contribution is significant. Does it significantly explain any of the variance in DV?

Answer 4

That the level of prediction is significantly better than just using the mean score to e.g., predict how well children performa at age 8

Answer 5

no, run a cubic equation against the data to see if this fits better. looking at the R2, residual plots etc but in general, the simpler the equation the better

Answer 6

The better the fit and the residuals become more random/normal but there is the danger of overfitting every predictor you add, takes away degree of freedom for testing significance so you're also losing power best to keep it simple - stop when the residuals meet the assumption (randomly distributed)

Answer 7

when you use the standard - you maximise the sums of squares of residuals in one go when use the procedure - SPSS uses the maximum likelihood procedure. still works iwht the sums of squares of the residuals - so in the end you get the same results - but it does this in a sort of iterative procedure

Answer 8

When you have a large data sets, multiple predictors or very complex non-linear models

Answer 9

transform → compute add in the quadratic, cubed and quartic terms as new coefficients then run a standard linear regression model including them

Answer 10

set parameters - name and starting value model expression tell SPSS this is the model we want to fit - e.g., quadratic model

Answer 11

* (1) ANOVA output – specifically look at the sums of squares for the residual and R2 value below it * Not this doesn’t give us p values and all that. Just the mean square for different elements (regression MS and residual MS) * (2) Parameter estimates

Answer 12

Parameter estimates table * Give you confidence intervals * Check these don’t pass 0 * If none pass 0 then keep them in the regression equation

Answer 13

* the coefficients for your parameters – for the constant/intercept, b1, and b2 * b1 basically the linear parameter and b2 the non-linear (squared in this case)

Answer 14

* need to have some idea of the equation you want to fit – whether its calculated by hand or through the SPSS procedure * the non linear regression procedure – needs starting values. But where you start might determine where you end up * as you add higher components to the equation – increases R2 but will produce some “curious” results. Overfitting.

Answer 15

That having added this third term it changed how the variance is accounted for by the. Different components

Answer 16

* Depends on your theory/hypothesis – might do research in an area where ppl used Non Linear Models before. The NLM you explore might be in agreement with those so just use those * Model fit – R2, coefficients, data plots, residual plots Find the model that best explains the relationship between your IV and DV’s. Typically a process of refinement or comparison between/across models till you find the jackpot

Answer 17

can get quite complex. We can have power terms for not only the non linear term for all IV’s but also the interaction term. Modelling such forms can get quite complex.

Answer 18

Might be difficult to know what to pick – previous research v helpful here. * Even if there isn’t a functional form you can grab from previous research * You can use a model/theory that makes quite specific claims about the functional forms that will help you limit your options as well

Answer 19

powers of X or combinations (interactions) of them.

Answer 20

* Based on research hypothesis and/or previous research * Can run the analyses multiple times at different starting values if youre worried about this

Answer 21

More explorative approach * If we don’t know how two variables are related, curve estimation is a good procedure to explore the data * You can just select lots of different functions and run it * All automated, don’t have to give it any starting values – useful to just explore the relationship * As well as the functions gives you the option to plot the data. Can plot each of the functions and evaluate what this looks like in comparison to the raw data ![]()

Answer 22

* Could pick the highest R2 * Look at the residual plots, regression coefficients etc can just ask it to give us all of these – remember the approach is quite automated if both look okay then the question is no longer a statistical but conceptual one à which is more relative to your experiment, the theory behind it and background research cross validation may also be useful here – if you have nothing conceptual to rely on (e.g., the theory/background research) then use this

Answer 23

**Multicollinearity bit** * If you have multiple IV’s might want to comment on the relationship between these **Any evidence of outliers** * Good way to spot this – look at residual plots **If evaluating multiple models – compare them and make a selection** **Then conclude final model and summary**

Answer 24

* Create your coefficients by hand and run it through linear regression menu * Use the non-linear regression procedure – specify model and pick parameters * Use curve estimation – more explorative method

non linear regression Flashcards

(50 cards)