logical regression Flashcards

Question

when would you use Euler

Answer 1

if you wanted to compute the odds or probability for a data point that is not in your dataset.

Answer 2

exp then in parenthesis whatever you have is just a different notation for eulers number raised to the power. this is how you would enter it (universal - this is how its done in R, SPSS, MATLAB).

Answer 3

E to the power of (logit), and taking the natural logarithm are the inverse operations of one another So you take the natural log of the odds to get the logit (orange arrow). To go from the logit to the odds you raise e to the power of the logit (green arrow) To get the odds you literally just type: “exp(logit value, -213.056 here)”

Answer 4

just means it's a really small number. If it's a negative sign after the E it just means you shift the decimal place this many places (93 here) to the left. if it was E+93, then you shift the decimal place 93 places to the right

Answer 5

odds/(1+odds)

Answer 6

Because the number is soooo small (look at the e- on the end). And remember we are doing logistic regression - so outcome has to be either 0 or 1 in this example, you can see the prediction/probability matches the outcome very well (yes/no column). so in this case the logistic regression equation does a really good job. if you plotted the relationship between the measure and the logit the relationship would be perfectly linear

Answer 7

linear this relationship has a limitation in range and has an S shape curve that is the best fit

Answer 8

when the prediction of the logistic regression is a “yes” or 1 its a case, and "no or 0 is not a case. sometimes your model might predict something to be a case when it's actually not. this is where the residuals come in (this is what we're trying to minimise when we fit the regression equation) (we try to minimise the sum of the squared residuals) (residuals of the logit)

Answer 9

you have a cut-off, typically .5

Answer 10

0 and positive infinity probability can only range between 0 and 1 they are related such that when you have an increase in the probability you have an increase in the odds

Answer 11

up to you!

Answer 12

take the negative of the constant and divide it by the coefficient Tells you where the datapoint in your dataset is exactly .5 so this data point might not exist in your dataset but if you wanted to know which level of the IV would mark the 50/50 split in prediction

Answer 13

just put in how many of the stuff was correctly labelled as either a fail or pass.

Answer 14

well, look at the classification table. Maybe you care more about it correctly predicting X rather than y. this might depend on the specific research question e.g., if its about predicting whether someone has a disease then of course you want to be extra careful about predicting a positive score by loweing the probability of a prediction we are more likely to get a case and then can do a follow up case to make sure. more liberal approach - more likely to catch the reas cases

Answer 15

the relationship between the IV and logit is linear. means fro every unit increase in your IV you have a unit increase in your logit. e.g., if the logit was 0. 1980 then a 1 unit increase in the IV would increase the logit by 0.1980. so thats a 1% increase regardless of if the attendance (IV) increased from 56-to-57% or 63-to-64% the probability and odds are not linearly related to the IV. that's the whole point were fitting a logistic regression equation because we don't have a linear relationship.

Answer 16

will always produce the same difference the ratio of successive odds is a constant (consistently the same) this is what we call the odds ratio (1.2190 here) why the odds ratio is often used as the effect size in logistic regression odds are a function of a one unit increase in your DV

Answer 17

will not be the same difference across all pairs

Answer 18

0.2551 \* 1.2190 = 03110 basically use the odds ratio as a multiplier to whatever the odds at 55% were to get subsequent odds

Answer 19

it uses an iterative procedure. basically it takes a guess at what the parameters should be and keeps changing them until the difference between sucessive solutions. s less than some critical value. this is because this way is more efficient if you have a large data set with complex equation it does this using the maximum likelihood method to get the estimates for your parameters means it selects/finds the coefficients that make the observed effects more likely to determine whats more likely it uses the sum of the squared residuals of the logit

Answer 20

because. ituses an iterative procedure starting at potentially different points. might give you a slightly different outcome to someone else

Answer 21

the dependent variable encoding table have a close look and make sure what. youconsider a case is coded as 1 e.g., this should be pass this is important not only because the nature of the equation wouldchange but so would our interpretations

Answer 22

SPSS making a prediction without usingany IV's. does a best guess basically -2 log likelihood is - a goodness of fit measure classification table * gives us the specificity - how good the model predicts real outcomes * specificity - % it correctly predicts what isnt a case (0) * sensitivity - % it correctly predicts what is a case (1)

Answer 23

* specificity - % it correctly predicts what isnt a case (0) * sensitivity - % it correctly predicts what is a case (1) just guessing everyone will pass gives an overall accuracy of 50% with a sensitivity of 100 and specificity of 0 with sensitivity of 100 and specificity of 0.

Answer 24

constant only

Answer 25

0 , we only have a constant and that has a value of 0 odds of this are:

Answer 26

We use block 0 as a benchmark - how well would you do if you used just the most frequent outcome? any subsequent model that uses you IV is assessed compared to that benchmark. If everyone had the same outcome we wouldn’t need to fit any model

Answer 27

Step where the actual IV(s) are being fitted

Answer 28

SPSS took six iterations to arrive at the final solution. The negative log likelihood has decreased from model 0

Answer 29

**Omnibus tests of model coefficients** Chi squared. Tells us the difference in the -2 log likelihood between block 0 and block 1, p value tells us whether this is significant

Answer 30

Goodness of fit test ## Footnote NOT like R2 in linear regression – does not tell us about the variance explained in DV. Don’t explain it in terms of this is makes no sense in the context of linear regression Referred to as pseudo R squares Nagelkerke pseudo R2 is the preferred one used in the literature because its normalised – varies in range between 0 and 1 A value of 1 is as good as it gets You SHOULD always report the value but its most useful when you compare different models together as higher values indicate better fit/better performance

Answer 31

another goodness of fit test. tells us how well the model fits the data. if it's not significant this means there is no significant difference between the **model** and the **data** if it is significant it tells you the prediction deviates significantly from the data. Model does **not** do a good job.

Answer 32

* constant * IV coefficients * odds ratio for each predicton - exp(B)

Answer 33

log odds = -12.259 + (0.198 \* attendance)

Answer 34

exp(B) is the odds ratio. here it is 1.219 which means that for every 1% increase in attendance their odds of passing increase by 1.219.

Answer 35

The significance of the coefficients here is determined using a WALD statistic. this is a bit conservative. using this, we can see neither the coefficients for the constant or attendance are significant ( p values are larger than .05). we might conclude looking ta this statistic then that including attendance doesn't help us at all - not significant.

Answer 36

No! the test itself is quite conservative. Even if it says something is not significant it's best to look at your overall model (omnibus test of coefficients) remember this compares. the2 log-likelihoods and decides if block 1 is a significant improvment or not. this is better to use as an indicator of significance. tells us at an overall level whether the inclusion of IVs significantly improved the model. WALD statistic is applied for each coefficient independently + more conservative.

Answer 37

this is helpful for smaller datasets. if you have thousands of people thne the classification plot is more helpful

Answer 38

FFFFFFFPPPPPPP on the bottom is the predicted outcome symbols above are the actual outcome also note how many ppl each symbol represents (here each symbol is .25 so every 4 symbols is 1 person) if there was a misclassification of bare ppl slightly to the right you might want to change your cut off to a higher value e.g., - .8

Answer 39

if there were any outliers. if no outliers the case. of residuals list will be empty

Answer 40

No, just like with any regression we can include multiple IV's in the model each variable each predictor will have its own coefficient

Answer 41

yes, but because. wedont have any IVs the logit is simply the value of the constant the odds ratio for that is .684

Answer 42

The -2 log likelihood has dropped from block 0 to block 1 indicating a better fit to the data. we can check whether this is significant by looking at the omnibus test (model row). second way: compare the sensitivity and specificity and overall model of the two in the classification table

Answer 43

When you are comparing multiple models higher value tells you the model is good. ocmpare this value across models to see which one is better

Answer 44

we want this to be insignificant. indicates there is no significant difference between the predicted values and observed data values

Answer 45

of course we also want to know whether all variables in the model contributed significantly to the model - this table indicates they do (under sig) so the WALD statistic confirms what the omnibus test told us

Answer 46

each coefficient has its own odds ratio Exp(B)

Answer 47

idealism has a an odds ratio of .502 * means a 1 point increase in idealism score leads to a reduction in the odds of someone deciding to continue the research * negative relationship between idealism and something being considered a case relativism has an odds ratio of 1.49 this means a 1 unit in increase in score of relativism leads to a change of 1.409 in the odds of someone deciing to continue the research

Answer 48

**idealism has an odds ratio of .502** * means a 1 point increase in idealism score leads to a reduction in the odds of someone deciding to continue the research * negative relationship between idealism and something being considered a case **relativism has an odds ratio of 1.49** * this means a 1 unit in increase in score of relativism leads to a change of 1.409 in the odds of someone deciding to continue the research * positive relationship between relativism ands oemething being considered a case

Answer 49

gender is a binary female * males coded as 1 * means the odds for men deciding to continue the research are 3.225 times higher than for women so basically it's looking at the difference between men and women and men were coded s 1 so anything it picks up is what differed between men and women.

Answer 50

Better to look at the odds as the coefficients are sensitive to scale while the odds are not so the magnitude of odds ratio is more informative in that respect

Answer 51

* logit regression equation * then can use this to predict the odd * e(logit) = .121 (basically odds of this person deciding to continue the research is low) * then we get the probability/prediction for thie which (odds/1+odds) leaves us with .108 they actually decided to continue the research (actual score 1) so the residual here is 1 - .108 = .891

Answer 52

The difference between the actual binary outcome and predicted probability of something being a case

Answer 53

Case wise list flags any outliers * default is any residual that exceeds 2 SD

Answer 54

* the residuals * Resid is our unstandardised residual that we get subtracting the observed and predicted outcome values * Then we have our standardised residuals * You can see here they all exceed 2 SD * Studentile residual exceeds 2SD as per the regression equation

Answer 55

Investigate them and look for a pattern to those cases which are identified as outliers e.g., a value entered in an incorrect way. E.g., a case with a residual of 10

Answer 56

They have a disproportional effect on the predictive equation. If you remove them this would effect the regression equation.

Answer 57

Keep it in if * you only have a few outliers e.g., 4 out of 315 * they are close to the threshold e.g., of 2 SD (remove it if was like 10 or something)

Answer 58

Bc DV takes the form of either 0 or 1 * thus any resifuals will be highly clustered

Answer 59

Would be very concerning in an OLD regression but not so much with logistic regression. Plots here are not high in diagnostic value and this is why we do not make use of them.

Answer 60

* so far we have used the enter method to run The logistic regression * for finding the best-worst IV’s in order we use the stepwise procedure * called the forward likelihood ratio - “forward LR” * this evaluates the contribution of each predictor with respect to the overall log likelihood of the model * is there a significant change int eh -2 log likelihood when this predictor was added into the equation * allows you to officially test the relative merit of each variable, and the order of their significance

Answer 61

Get output in a series of steps – 1 step for each variable in the model. e.g., 3 steps if 3 variables are added to the model each step – another predictor variable is added and you get an update of the results

Answer 62

The different steps adding a new predictor into the model. Remember the order they are added reflect their relevant contributions to the overall fit and reduction in -2 log likelihood * step 1 idealism is added (best predictor) * step 2 gender has been added (second best predictor) * step 3 relativism is added (third best predictor) * when more variables are added to the model the coefficient and odds ratio of the other variables change * note idealism they both change when gender is entered, then again when gender + relativism is entered.

Answer 63

They would be the exact same.

Answer 64

* get a different classification table for each variable added into the analysis * what is important to report when reporting the classification table? The cut off point * in all classification tables the specificity, sensitivity and overall percentage correct for a model is affected by the cut off point

Answer 65

1. Approach you derive the regression equation from one dataset then apply it to another dataset. Can use CV to test how well the regression equation can predict novel data 2. Could compute the regression equation for half the dataset then test this model on the other half 3. Use this regression equation in the other dataset to predict the outcome (logit, odds, and probabilities. Use these along with the cut off to make a prediction (whether something is classified as a case or not a case) 4. Then for this new dataset - cross-tabulate the predicted outcome against the predicted outcome. The “cross-validation classification table” 5. Then use overall accuracy, specificity and sensitivity to determine if you have a good regression equation

Answer 66

Here we used t tests and correlations to test how well a model predicts data (generalises). In logistic regression we use the classification table to test how well the model does this.

Answer 67

Left column stop and continue (actual outcome) while top stop and continue (predicted outcome) 53 decided to stop – correctly predicted to stop, 6 others decided to stop but were incorrectly predicted to continue. 35 decided to continue but were incorrectly predicted to stop, and 21 decided to continue and were correctly predicted to continue. * Specificity of model: 53/59 = 89.8% * Sensitivity of model: 21/56 = 37.5% * Overall accuracy: (53 + 21) / 115 = 64.3% Overall accuracy = everyone correctly predicted to stop/continue divided by everyone.

Answer 68

To make this decision whether this model is a good model you might have to compare this to performance based on chance. Both in the overall dataset and/or in the new independent dataset.

logical regression Flashcards

(96 cards)