Machine Learning Flashcards
input/output table
represents relationship between inputs (features) and output (target values)
what is the goal of machine learning
find underlying function from the data that maps inputs to outputs
parameterised hypothesis
model tat is defined by parameters and the learning process is about adjusting these to fit the data
hypothesis vs paramterized hypothesis
hypothesis is the proposed function and parameterised is the model that includes parameters adjusted during learning
3 types of learning
reinforcement
supervised
unsupervised
supervised learning and example
agent learns a function given labelled data inputs and outputs
parameterised model that maps inputs to outputs is chosen and the learning is to find correct parameters that produce correct output
spam/ham
unsupervised learning and example
agent is given unlabelled data and has to identify patterns in data
parameterised model is chosen and learning algiorhtm works out parameter values that organises data
used for understanding data
weather patterns
reinforcement learning and example
agent learns by interacting with the environment and is given positive/negative feedback based on its actions
learning algorithm wants to chose parameter values that maximise rewards
gets reward and looks back at past actions to see what led to the rewards
AI game playing raj
inductive learning
system tries to induce a general rule from a set of observed instances
exam question given in class and put same in exam can’t tell if have just memorised rope learnt or understands
hypothesis consistency
consistent if correctly classifies all training examples
generalisation
ability of model to adapt to new unseen data not just on training
best models are those that can adapt on unseen data (generalise)
overtraining
performs well on training data but poorly on new data
overfitting
model becomes too attuned to thee data now which it was trained and loses applicability to any other dataset
Occam’s razor
heuristic suggesting choosing simpler ML models as they are expected to generalise better
discriminative model
model that attempts to model the agent function directly