quiz 5 Flashcards
heart disease description
- is the leading cause of death in the US
- risks vary by sex, age, and ethnicity
- linked to physical activity levels and diet
when do you use a bivariate linear regression?
- when you have 2 variables
- should fit a generally linear relationship
bivariate linear regression
- explains the sample by describing a line
- model is built by parameterizing, or calculating m and b
what are some ways to model relationships between variables
- measure samples to understand population
- build model to explain relationship for sample
- use model to predict or extrapolate the values of untested individuals
what are two linear regression techniques?
- bivariate
- multiple linear regression
bivariate
- one predictor
- BMI
- used together to predict variation in a “response”
multiple linear regression
- more than one predictor
- Age, BMI, LDL-C
what are the 3 questions to ask about a regression model?
- What is the calculated model?
- How much variation in the data does it explain? (R square value)
- does it explain a significant amount of the variation? (F&p-value)
more parameters=
model is better at predicting the noise in your data
what does each parameter have associated with it?
error
Real world dirty approach
includes only significant predictors
real world good approach
Akaike Information Criterion (AIC)
- runs from 1 to -1
-1=did something wrong
1= great
0= not as efficient
regression
- a model building technique that uses sample to build a mathematical model
- linear regression produces a model 0of a straight line
R^2
a measure of model fit
- should use adjusted form if you have more than one predictor