Simple linear regression Flashcards

Question

What do the coefficients refer to?

Answer 1

The characteristics of the regression line: - Beta values: the slope of the regression line - The intercept

Answer 2

The value of the slope (b1) - for every one unit change in x, the change in the value of y - in units of measurement If b1 is 0, there is no relationship between x and y (flat line - as predictor variable changes, predicted value of outcome is constant and does not change) - If variable significantly predicts outcome, b value should be different from 0 - tested using a t-test (H0: b = 0) - if test is significant, interpret as supporting that predictor variable contributes significantly to ability to estimate values of outcome.

Answer 3

- a measure of the slope The standardised change in y for one standard deviation change in x - As x increases by one standard deviation, y changes by b1 of a standard deviation

Answer 4

b1 = r(xy)

Answer 5

- when you want coefficients to refer to meaningful units - when you want a regression equation to predict values of Y

Answer 6

(independent of units) - when you want an effect size measure eg. small/med/large β is equivalent to small/med/large r (.1/.3/.5) - when you want to compare the strength of a relationship between predictor and outcome

Answer 7

The extent to which variables co-vary (change together) High covariance means there is a large overlap between patterns of change (variance) observed in each variable

Answer 8

- Detect bias from unusual cases (outliers) - Check assumptions of linear regression

Answer 9

An observation with a large residual (the differences between observed and predicted values - e = Yobs - Ypred) - may distort results by pulling regression line away from line of best fit for most people - has potential to be an outlier is standardised score (or Z score) on 1+ predictors/standardised score is in excess of +/-3.29

Answer 10

They influence the model's ability to predict all cases

Answer 11

Distance between Yobs and Ypred (residual) - the larger the distance, the weaker the prediction Leverage (unusual value on predictor) - large leverage can either weaken or strengthen prediction depending where they lie related to the trend - on trend = strengthens results. large leverage + large distance -> negative impact of pulling or tilting regression line away from LOBF

Answer 12

+/-3.29 (p<.001)

Answer 13

- check data were entered and coded correctly - can justifiably remove outliers that are due to errors in data entry/ppt procedure following (eg. reaction times that are impossibly short or long) Outliers CAN represent genuine data - for every 100ppts, expect 1 score beyond +/-3SD

Answer 14

1. Linearity 2. Independence 3. Normality of residuals 4. Homogeneity of variance (homoscedasticity)

Answer 15

The outcome (continuous variable) is linearly related to predictors

Answer 16

Observations are randomly and independently chosen from population - residuals are not related to each other. Residuals not independent in cases such as: - repeated obs on same ppt - obs from related ppts (twins, students in same class) If this assumption is violates, model standard errors (SEs) will be invalid, as will confidence intervals (CIs) and sig tests based on them. - ensure INDEPENDENT sampling in design

Answer 17

Residuals (not IVs or DVs) should be normally distributed - check using histogram and normal probability plot want observed and expected frequencies to be very similar - 45 degree straight line. - in small samples, a lack of normality invalidated confidence intervals and significance tests BUT in large samples, it will not due to central limit theorem.

Answer 18

The variability of residuals should be the same for all values of Ypred. Violating these assumptions invalidates confidence intervals and significance tests Check using residual scatterplot - should be NO funnelling of residuals.

Simple linear regression Flashcards

(42 cards)