Ch 2. Intro to Regression Models Flashcards
In the expression Sales ≈ f(TV, Radio, Newspaper), "Sales" is the: Response Training Data Independent Variable Feature
Response
Explanation:
The variable which you are trying to model is called the response or outcome. The other variables are called features, predictors, or independent variables. Together, the collection of features and response values that you will use for fitting form your training data.
A hypercube with side length 1 in d dimensions is defined to be the set of points (x1, x2, …, xd) such that for all j = 1, 2, …, d. The boundary of the hypercube is defined to be the set of all points such that there exists a j for which or (namely, the boundary is the set of all points that have at least one dimension in the most extreme 10% of possible values). What proportion of the points in a hypercube of dimension 50 are in the boundary? (hint: you may want to calculate the volume of the non-boundary region)
Please give your answer as a value between 0 and 1 with 3 significant digits. If you think the answer is 50.52%, you should say 0.505:
0.995
Explanation:
We know that the volume of the whole hypercube is 150 = 1. The volume of the interior of the hypercube is 0.950 = 0.005. Thus, the volume of the boundary is 1-0.005 = 0.995.
True or False: A fitted model with more predictors will necessarily have a lower Training Set Error than a model with fewer predictors.
False.
Explanation:
False. While we typically expect a model with more predictors to have lower Training Set Error, it is not necessarily the case. An extreme counterexample would be a case where you have a model with one predictor that is always equal to the response, compared to a model with many predictors that are random.
While doing a homework assignment, you fit a Linear Model to your data set. You are thinking about changing the Linear Model to a Quadratic one. Which of the following is most likely true:
- Using the Quadratic Model will decrease your Irreducible Error.
- Using the Quadratic Model will decrease the Bias of your model.
- Using the Quadratic Model will decrease the Variance of your model
- Using the Quadratic Model will decrease your Reducible Error
correct
- Using the Quadratic Model will decrease the Bias of your model.
Explanation
Introducing the quadratic term will make your model more complicated. More complicated models typically have lower bias at the cost of higher variance. This has an unclear effect on Reducible Error (could go up or down) and no effect on Irreducible Error.
Look at the graph given on page 30 of the Chapter 2 lecture slides. Which of the following is most likely true of what would happen to the Test Error curve as we move 1/K further above 1?
- The Test Errors will increase
- The Test Errors will decrease
- Not enough information is given to decide
- It does not make sense to have 1/K > 1
- It does not make sense to have 1/K > 1
Explanation:
Since K is the number of neighbors, the value of K must be a Natural Number. This means that 1/K <= 1.
You are doing an analysis in R and need to use the ‘summary()’ function, but you are not exactly sure how it works. Which of the following commands should you run? (There is more than one correct answer, so any one these will earn the point).
- help(summary)
- ?summary
- man(summary)
- ?summary()
- help(summary)
- ?summary
- ?summary()
Explanation:
Any of the above commands work except for ‘man(summary)’. Make sure you always read the documentation so you know what functions do when you use them!
Indicate whether we would generally expect the performance of a flexible statistical learning method to be better or worse than an inflexible model when:
The sample size (n) is extremely large, and the number of predictors (p) is small:
Flexible is better
Explanation:
A flexible model will allow us to take full advantage of our large sample size.
The number of predictors (p) is extremely large, and the sample size (n) is small:
Flexible is worse
Explanation:
The flexible model will cause overfitting due to our small sample size.
The relationship between the predictors and response is highly non-linear:
Flexible is better
Explanation:
A flexible model will be necessary to find the nonlinear effect.
The variance of the error terms, i.e. sigma^2 = Var(error), is extremely high:
Flexible is worse
Explanation:
A flexible model will cause us to fit too much of the noise in the problem.