Things i should know Flashcards
What is the curse of dimensionality wrt supervised learning
The number of training examples needed to correctly learn increases exponentially as we increase the dimensionality of our inputs.
Advantages of hierarchical clustering over K means and visa versa
- don’t need to know clusters in advance
- easy to understand
- Faster
- Easier to implement
What are semantically different hypotheses?
All permutations of hypotheses being where each element can be x+1 things. x being its possible values. +1 being ?. We also add 1 to represent the specific hypothesis. i.e (x+1)(y+1)…+1
What are syntactically different hypotheses?
All permutations of hypotheses being where each element can be x+2 things. x being possible instance values. +2 being ? and ø.
Is a rote learner biased?
No. It is trained on the set of arbitrary disjunct hypotheses and is therefore not biased. This does mean that it has overfit the training data and will not be able to generalise
What fundamentally classifies a markov decision process?
Past actions do not affect the transition function. The markov property is when the transition function is preserved throughout the entire process.