Chapter 9: Linear Model (Regression) Flashcards

Question

studentized residuals

Answer 1

unstandardized residual divided by an estimate of its SD that varies point by point have the same properties as standardized, but provide a more precise estimate of the error variance of a special case ## Footnote use this bc its easier to interpret

Answer 2

predicted value of the outcome for that case from a model in which the case is excluded estimate the model parameters excluding a particular case and use this new model to predict the outcome for the case that was excluded. if the model is stable then the predicted value of a case should be the same regardless of whether the case was used to estimate the model

Answer 3

difference between the adjusted predicted value and the original observed value ## Footnote tell us about the influence of cases on the ability of the model to predict that case, but not about how the case influences the whole model

Answer 4

the deleted residual is divided by the standard error to give us this value. it can then be compared across different regression analyses

Answer 5

gauges the influence of the observed value of the outcome over the predicted values.

Answer 6

data with an extreme value on a predictor variable (x) these points are extreme on the x-axis. if it drags the regression line towards it, it is influential all leverage points should be close to the average value (k+1)/n cases with 2-3x the average leverage value should be investigated/concerning ## Footnote range from 0-1. a value of 1 indicates the case has complete influence over prediction

Answer 7

a case is influential if the model parameter estimates change substantially if the case is deleted and the model reestimated a good model should not be so fragile that 1-2 cases change it a lot cases will be influential if: they have some combination of being extreme on X (leverage) and extreme on Y (outliers) influence is what matters most if conclustions change based on 1-2 data points, conclusions are said to be fragile | not all influential points have large residuals ## Footnote can be examined using: Cook's distance, Difference in Beta (DFBeta), Difference in Fit (DFFit).

Answer 8

measure of the overall influence of a case on the model abs values greater than 1 may be a concern

Answer 9

measure of how much the estimates of the b's change when a case is deleted abs values greater than 1 may be a concern

Answer 10

measure of the difference in prediction when a case is deleted abs values greater than 1 may be a concern

Answer 11

they are influential first check that influential points are not coding error if it is not coding error: does the case change the conclusions? is it possible to get more observations near that value of X? if case does change the conclusion: report the results with and without the influential case or restrict your analysis to values of X where the relationship holds do not use this to drop cases to create desired results (phack)

Answer 12

1) additivity and linearity 2) independent errors 3) homoscedasticity 4) normally distributed error

Answer 13

outcome variable and predictors are linearly related/can be described by a linear model do not use a linear model to describe a nonlinear relationship

Answer 14

residual terms should be uncorrelated/independent assumption necessary for CIs and significance tests to be valid if violated, use robust methods or a multilevel model

Answer 15

residuals at each level of the predictor should have the same variance if violated, CIs and significance tests are invalidated. use weighted least squares regression instead.

Answer 16

residuals in the model are random, normally distributed variables with a mean of 0 does not matter with large sample sized because of CLT if violated with a small sample size, use bootstrapped CIs

Answer 17

1) predictors are uncorrelated with external variables 2) Variable type: quantitative or dichotomous predictors, and continuous unbounded criterion 3) no perfect mullticollinearity 4) non zero variance

Answer 18

should be no external variables that correlate with any of the variables included in the model regression results can be biased by an omitted (3rd) variable if violated, conclusions are unreliable

Answer 19

all predictor variables must be quanititative or categorical, and the outcome must be quantitative, continuous, and unbounded

Answer 20

if your model has more than one predictor then there should be no perfect linear relationship between 2 or more of the predictors (predictors should not correlate too highly) if violated, lead to untrustworthy estimates of the b's, and SE gets very big

Answer 21

predictors should have some variation in value (not have variance of 0)

Answer 22

assessing the accuracy of a model across different samples (how it generalizes to different samples) two main methods: adjusted R^2 and data splitting

Answer 23

tells us how much variance in the predicted outcome would be accounted for if the model had been derived from the population from which the sample was taken (estimates what r^2 would be in a new sample) when you try to apply a model to a different sample, R^2 from sample 1 to sample 2 will drop, causing a loss of predictive power known as SHRINKAGE. shrinkage occurs because the process of fitting a model capitalizes on chance more capitalizing on change + more shrinkage will occur when there are more predictors and with smaller sample sizes. with large samples, a model will be well estimated and shrinkage will be minimal | regression models are optimized for the sample they were created from

Answer 24

randomly splitting your sample data, estimating the model in both halves of the data, and comparing the resulting models (80/20(model))

Answer 25

situation specific power analysis is the best way to determine sample size needed. General guidelines: - if you expect to find a large effect: 80 people or higher - if you expect to find a medium effect: 100 people or higher if there are 6 predictors or less - if you expect to find a small effect: don't bother unless you can get a very large sample

Answer 26

1) forced: predictors are added into the model at once. it is useful for testing theory. no established predictors 2)** hierarchical:** predictors added in blocks. established predictors added earlier in the process. new predictors asses as a group last. useful for testing theory or validity of new predictors. 3) stepwise: automatic method. predictors added to the model one by one based on partial correlations. process stops when removal criterion is met (i.e., regression coeff is not significant for the added predictor). useful for exploratory analyses when you have no idea what's going on what want to generate hypotheses. it gives models that can't generalize, and is frowned upon. - this method is sensitive to sampling variation and the results don't generalize ## Footnote adding control variables into the model doesn't purify the analysis and their inclusion can result in inappropriate inferences

Answer 27

- explaining data while being as simple as possible - accounting for variance in the simplest way - more predictors account for more variance - R^2 increases with more predictors added/see if new predictors have value - change R^2 from simple model to complex model: if its sig, then sampling error alone cannot account for this difference.

Answer 28

- change in R^2 (significance and magnitude) - Akaike Information Criterion (AIC): lowest AIC = most parsimonious model, penalizes you for adding predictors ## Footnote you can compare different models and access parsimony using R^2 change and AIC. R^2 should significantly change, magnitude change, lower AIC = argument for the complex model

Answer 29

- occurs when the predictor variables themselves are related to each other (highly correlated) - for ex. trying to predict lawyer salary based on age and experience (hard to tease apart) - level of multicollinearity varies from none to perfect multicollinearity (continuum). if none, it means that all predictor variables are unrelated. if perfect, can't fit a regression model because there are infinite models - mild multicollinearity is not a big deal but high multicollinearity makes it more difficult to estimate the b's - high multicollinearity can result in the standard errors of the coeffs being very high. this means more errors in each estimate and a wider sampling distribution (parameters not well estimated) - parameter estimates change wildly from sample to sample; CI's will be wide; and harder to find significance

Answer 30

- simplest way: examine the correlations among predictors. correlations that are higher than .8 or .9 suggest multicollinearity may be an issue - examining the correlation matrix will miss more subtle forms of multicollinearity - we need stats that are specific to detecting multicollinearity. one is the variance inflation factor (VIF) for each predictor. (VIF = 1/(1-R^2k) - VIF > 10 is a problem

Answer 31

1) do nothing 2) get rid of the variables 3) combine the correlated variables 4) use a method that can handle highly correlated variables like partial least squares or principal component analysis

Answer 32

- variables expressed as z scores - Zy = r(Zx) - predicting Y's z score, input is not the person's score on X but rather their z score on X

Answer 33

Zy = Beta(Zx) + Beta(Zx) ... - the betas take place of the correlation coeffs becaue we have to account for the other predictors - the std multiple regression equation works the same way as the std simple regression equation. Plug in someone's z score on the X variables to get their predicted z score on the Y variable - the std betas refer to how many SDs the outcome will change for every SD change in the predictor - can be extended to include more predictors | std betas help assess relative importance of dif predictors in SD units

Chapter 9: Linear Model (Regression) Flashcards

(57 cards)