Data Analysis week 7 Flashcards
What is the residual e_i defined as
The difference between the datapoint y_i and the predicted value. So y_i = alpha * x_i + beta + e_i
What is the residual sum of squares
The residual sum of squares (RSS) is e_1^2 + e_2^2 + … + e_n^2. It measures how badly the model explains the data (in its loss function).
What is the relation between alpha and beta and the RSS
Computing alpha and beta with the covariance results in the regression model with the smallest possible RSS.
What is a loss function
Functions that show how badly a model explains data.
What is another example of a loss function
The mean absolute error
What is so special about RSS as a loss function
The values for alpha and beta that minimize RSS can be mathematically derived, while for other loss functions, you need a computer for this. And RSS can be motivated by viewing linear regression as a statistic model.
What other role can the normal distribution play
The normal distribution can be an approximation of the residuals.
What do likelihood functions tell you
Likelihood functions tell you how good your model is. Likelihood is high when your model is good.