Week1 Flashcards
What is the definition of machine learning?
ML is the study of computer algorithms that improve automatically through experience and use of data.
How does machine learning task hierarchy is?
Humans make a tool design which uses data and produces a tool which will take the input and give a output.
In ML, Humans design learning algorithm which uses data to produce models
Why ML?
- Programming /Human Labour Fails
a. Scales / Speed / Cost of human labour
b. Inability to express rules using language
c. Don’t know the exact rules transforming input to
output - ML can succeed
a. Have lots of example data
b. Have some structural ideas on data
Where is ML used exclusively now?
- Classifier - Mail SPAMs
- Shopping cart recommendations
3, ML in Smart Assistants - ML in Robot AIs
- ML in Games
What is Data?
Data is collection of vectors
Metadata is information on the data
What is a model?
A model is the mathematical simplification of reality
What are the types of models in ML
- Predictive Model
a. Regression Model: predict a real valued continuous
output
b. Classification model: predict non-real valued discrete
output - Probabilistic Model - scores different likelihoods of reality
What are learning algorithms?
Learning algorithms convert data into models.
choose from a collection of models with same structure but different parameters. use the one with the optimal parameter value to infer.
What are supervised learning?
In simplified terms, supervised learning is curve-fitting. Goal is given {(x1,y1),(x2,y2),…. (x^n,y^n) are vectors , find a model f such that f(x^i) is close to y^i
There will be training data which will be used to create the model and there will be test data which will be used to evaluate and adjust performance.
Supervised learning tasks
1. Regression
2. Classification
Notation: How is the third coordinate of 8th vector denoted?
x sub script 3 superscript 8
Notation: what are indicator functions?
Indicator functions are boolean functions which take a predicate as the input and gives True or False
e.g. 1(2 is even) =1 , 1(2 is odd)=0 , where 1(predicate) is the indicator function.
What is the loss function of a regression problem?
loss = (1/n) * Summation (f(x^i) - y^i)^2
Squared sum of error (SSE)
Loss denotes the difference between the values in prediction and reality as a single valued metric.
What is the generic regression model function?
f(x) = w^Tx + b , where W^T is the weight vector and x is the the data matrix.
What is a classification problem?
Output is not real valued like regression but are of form ‘Yes ‘ or ‘No’ , ‘True’ or ‘False’, +1 or -1
What is the loss function of a classification problem?
Loss here is the fraction of misclassified instances.
loss = (1/n) Summation (i=1 to n) 1(f(x^i) not equal to y^i)