Exam 1 Flashcards
Why use models?
To understand the relationships between variable
To predict future outcomes
To quantify differences between groups or treatments
Response variable
the variable that you want to understand/model/predict. aka - y, dependent variable
explanatory variables
the variables you know and think that they are maybe related to the response variable that you want to use to figure out a pattern/model/relationship. aka - x, independent variable, predictor variable, covariates
model
a function that combines explanatory variables mathematically into estimates of the response variable
error
what’s left over; the variability in the response that your model doesn’t capture (error
is somewhat of a misnomer – maybe noise is a better term)
Categorical Data
Two outcomes, not numerical
Quantitative variables
Numerical
Parameter
Describes entire population
Statistic
Describes sample
The four-step process
- Choose
- Fit
- Assess
- Use
Model Notation
Y = f(X) + e
ybar or xbar
averages
yhat
estimate
Y = ? (Simple Linear Regression)
Beta0 + Beta1*X + e
Yhat = ? (Simple Linear Regression)
Beta0 + Beta1*X
Naive Model
Mean + Error
Age = Agebar + e