Lecture 30- Regression Flashcards
What two variables do we talk about in linear regression?
- Explanatory variable (X), also known as covariate, predictor, or independent variable.
- Outcome variable (Y ), also known as response or dependent variable.
How does the nature of the explanatory variable (x) effect how we compare two groups (the x and y)?
- In studies with outcomes on continuous scale we compare means in the two groups
- In studies with binary outcomes, we compare the two groups using odds ratios or relative risks.
What are the different types of regression and what are they used for?
Regression in general looks at the relationship between two variables (1 variable= the x explanatory variable, 1 variable= y outcome variable)
Linear regression - used where the outcome is continuous
Logistic regression - used where the outcome is binary
What tool allows us to visually see a regression/ relationship?
A scatterplot, explanatory variable is on the x axis and the outcome variable is on the y axis
What are the two purposes of simple linear regression?
-To describe the relationship between two variables and test whether
changes in a continuous outcome measure may be linked to changes
in the explanatory variable
-To enable the prediction of the value of the outcome measure given
the value of the explanatory variable.
How do we describe a straight line in a pure mathematical sense?
y=mx+c
How do we describe a straight line/ linear regression in stats language? What is the ‘extra’ thing we have to account for?
Y = β0 + β1x + e
β0= constant/ y intercept β1= m/ gradient e= random error/ the residual term
When carrying out statistical modeling what do we assume about the standard error and so what does the linear regression equation become? What is this telling us?
It is assumed that the error terms have zero mean. So the regression model loses the e and becomes…
µY = β0 + β1x.
This is known as the conditional mean µY |x
When you know the true values of β0 and β1 this equation describes how the mean
response changes with x at a population level.
Do problems on slide 571
answers on slide
What is β1 interpretable as?
What is β0 interpretable as?
- β1 interpretable as change in mean response when x increases by one unit.
- β0 is mean response when x = 0 (as is the y intercept), but may make no sense in many examples.
When we are considering e what does it tell us and what do we assume?
-Error term e describes how an individual’s response differs from the mean
of all individuals in population with the same value of x.
-It is usual to assume that the variation in response within any given sub-population (described by x) is normally distributed.
-In other words, we assume that e is a N(0, σ2e) random variable.
-Variance term σ2e describes magnitude of variation in sub-population
In a practical sense what is e equal to?
particular value of Y measured - mean
Answer the true and false questions on slide 575
Answers on slide