SRM Chapter 1 Flashcards

Question

Bayes Classifier

Answer 1

- Best decision function - When the Bayes Classifier is used, test error rate is minimized and this is the best decision function

Answer 2

- Function (f) for classification problems that decides which category Y (dependent variable) belongs to

Answer 3

1. Prediction - predicting values of y based on x's 2. Inference - understanding the impact of changes in x's on the value of y

Answer 4

- Describes how closely f-hat can follow the data - Related to prediction (more flexible model means more accurate predictions) - Rougher fit = more flexible f-hat - Smoother fit = less flexible f-hat

Answer 5

- Ability to understand what the model is doing (components, parameters) - Related to inference (easier to explain the specifics in the relationship between x's and y if we understand what the model is doing)

Answer 6

- More flexible f-hat - Often more parameters

Answer 7

- Less flexible f-hat - Often less parameters (simpler function)

Answer 8

- Inverse relationship - As flexibility increases, we are able to make more accurate predictions but more parameters means the model might be harder to understand/interpret

Answer 9

- More flexibility doesn't always mean more accurate predictions *in general* - It means more accurate predictions *on the training data* only

Answer 10

- Mean squared error - Measures error in regression models - We want this number to be small (smaller MSE means more accurate)

Answer 11

- Inverse relationship - Training MSE decreases as flexibility of f-hat increases

Answer 12

- Happens when f-hat fits the training data too closely - Won't carry over well to new data (test data) so predictions on the test data won't be as accurate - Often happens when f-hat is too flexible, modelled too closely to the training data) - Too rough fit, too flexible

Answer 13

- f-hat is not robust (flexible) enough to capture relationships between the y and x's - Too smooth fit, not flexible enough

Answer 14

- Training MSE is not always a good indicator of model accuracy because minimizing the training MSE only means that accuracy is maximized on the training data, not the testing data. - So, test MSE is a better indicator of model accuracy

Answer 15

- Mean squared error based on the training data - Goes down as flexibility increases - Not the best indicator of model accuracy because based only on the training data

Answer 16

- Mean squared error based on the test data, not based on past observations - This makes it a better indicator of model accuracy - U shaped as flexibility increases - Not flexible enough means that it's too smooth of a fit (underfitting); the relationship between x's and y is not captured enough - Too flexible means that it's too rough of a fit (overfitting); f-hat is too closely fitted to the training data but on the test data accuracy declines - So the best test MSE is usually produced by a moderately flexible model

Answer 17

- We want both variance and bias to be low - Increasing flexibility increases variance though it decreases bias. - Decreasing flexibility decreases variance, but it increases bias.

Answer 18

- Variance in y (dependent variable) that can't be explained by f-hat

Answer 19

- Var(f-hat) + (Bias(f-hat))^2 - The variance in y that can be reduced by choosing the best model - Want to balance: want low variance and low bias though there is a tradeoff between the two

Answer 20

- How f-hat changes when different training data is used - Want this to be low (little variability between sets of training data) - Bigger variance means f-hat changes more depending on the training data used

Answer 21

- How close f-hat is to the actual shape of f - Want this to be low (close)

Answer 22

- F low - V low - B high - As flexibility decreases, variance also decreases but bias increases (underfitting) - F high - V high - B low - As flexibility increases, variance also increases but bias decreases

Answer 23

- As flexibility increases, so does variance - Because as flexibility increases, the model gets more specifically fit to that particular set of training data, so there is more variance in the shape of f-hat when using different training data.

Answer 24

- As flexibility increases, bias decreases - By increasing flexibility we are able to get f-hat closer to the actual shape of f, which means squared bias decreases. - Bias happens/grows when f-hat is not flexible enough/ too simple to catch the patterns and shape of f (underfitting).

Answer 25

- Measure for classification model error - Uses I (indicator function). 1 if correct, 0 if otherwise

Answer 26

- Using Bayes classifier in place of Y-hat in the test error rate indicator function - When this is used, test error rate is at a minimum and the Bayes Indicator is the best decision function.

Answer 27

1. Find the location of the observation in the domain of X1,...,Xp. This is the centre. 2. Identify the k nearest training observations to the centre. 3. The most frequent category of the k training observations is the prediction y-hat.

Answer 28

- Euclidean distance

Answer 29

- k too large: observations are too far away from the centre of the neighbourhood so predictions are too general. - k too small: observations are unstable/volatile (dependent on a small few). - Want a middle-sized k because of the bias-variance tradeoff.

Answer 30

- k is inversely related to flexibility. - A small k means y-hat is very dependent on a small number of observations, so flexibility is high (very tailored to those few observations). - A large k means y-hat is very generalized, so flexibility is low.

Answer 31

Less flexibility

Answer 32

More flexibility

SRM Chapter 1 Flashcards

(56 cards)