Machine Learning Flashcards

Question 1

Q

What is Machine Learning? (ML)

Answer

A

The study of computer algorithms that improve automatically through experience and the use of data. It’s part of AI

Question 2

Q

How does ML work?

Answer

A

Machine learning algorithms build a model based on sample data, known as “training data”, in order to make
predictions or decisions without being explicitly programmed to do so.

Question 3

Q

How do you formalize ML?

Answer

A

ML can be described as a function Y=H(X) where the goal is to find the most simple H which predicts Y using X as input for a given prediction accuracy

Question 4

Q

What do you call the performance of H in matching Y using X?

Answer

A

The Objective function

Question 5

Q

How do you find the objective function?

Answer

A

Obj(H) = L(H) + omega(H)

where L is the matching error
and Omega is the regularization term/complexity of H

Question 6

Q

What does ML consist of in terms of the objective function?

Answer

A

Minimizing the Obj(H) as the best potential compromise between prediction accuracy and complexity

Question 7

Q

What are the main categories of Machine Learning?

Answer

A

Supervised: classification & regression
Unsupervised: clustering, association & dimension reduction (generalization)

Question 8

Q

What is the difference between supervised and unsupervised ML?

Answer

A

Supervised: data is pre-categorized

Unsupervised: data is not labeled

Question 9

Q

What are the main ML application/tasks?

Answer

A

Forecasting and classification

Question 10

Q

What are the main categories of ML engines

Answer

A

-Linear/non-linear regressions
-Random forests and boosted trees
-Deep learning and neural networks

Question 11

Q

What is a linear regression?

Answer

A

You model the relationship between two variables Y and X where X explains Y such that:

Y= aX+b

where a=Cov(Y,X)/Var(X)
and B=E(Y)-aE(X)

(remember Y is what you want to predict and X is the explanatory variable)

Question 12

Q

What do you need for the regression to be complete?

Answer

A

The mean of the residue should be normally distributed with a mean of 0

Question 13

Q

What are the steps in training AI predictive models?

Answer

A

Building the model
Training the model on sample data
Testing the model on different sample data

Question 14

Q

What is one of the main challenges in training ML algorithms?

Answer

A

Avoiding overfitting so that it only works on the training data sample

Question 15

Q

How do you avoid overfitting?

Answer

A

You keep the model as simple as possible (few parameters)

Question 16

Q

What is the trade-off in training a predictive engine?

Answer

Study These Flashcards

A

Between testing error and model complexity

Question 17

Q

What is a decision tree?

Answer

Study These Flashcards

A

Tool that uses that uses a tree-like model of decisions and their
possible consequences, including chance event outcomes, resource costs, and utility. It is one
way to display an algorithm that only contains conditional control statements.

Question 18

Q

What are the risks inherent to AI?

Answer

Study These Flashcards

A

For Data: Biased samples, correlation is not causality, lacking features, changes in patterns

For Algorithm: lack of explainability, overfitting, design flaws, lack of contextual sensitivity and common sense

Machine Learning Flashcards

(18 cards)