Lecture 4 Flashcards
There are many ways to communicate the same thing
like 75
can be 75
or seventy five
or IIIIIIIIIIIIIII(75 sticks)
mark
basic geometric element that depict items or links
channel
Control the appearance of marks
Mark types in table datasets
mark represent an item
Mark types in network datasets
represent either an item (node) or a link that represents a relationship between items
A connection mark shows a pairwise relationship between two items using a line
true
A containment mark shows hierarchical relationships using areas (nested connection marks within each other at multiple levels)
True
Bootstrap presents a way to resample your sample over and overhowever it does it with replacement unlike cross validation
true
If n is way larger than predictors, your model will perform well
True
If I want to reduce the number of predictors I can use subset selections, one of these is
True
We have 3 ways for subset selection
1- Best subset selection
2- Forward subset selection
3- Backward subset selection
Best Subset Regression (selection):
Minimize RSS ( square distance between what we predict and the values that we have in the data)
Maximize adjRsquare( how much proportion the predictors explain the variance in the response)
For best subset selection we need leaps
TRUE
adjusted r-square
Adjusting R-square by the number of predictors
Lasso approach
It will shrink coefficients and it will do feature selection