Week 5 Flashcards

Question 1

Q

What is Likelihood?

Answer

A

Probability of observing data given a particular model.

Question 2

Q

What are the likelihood functions for discrete/continuous for Xis?

Answer

A

Where vector X is a sample from a distribution with parameter theta:

Discrete = PX(x;theta)

Continuous = fX(theta | x)

Question 3

Q

What is a Maximum Likelihood Estimate?

Answer

A

A MLE of theta is a value of theta that maximises the likelihood function.

Question 4

Q

What is a Maximum Likelihood Estimate?

Answer

A

A MLE of theta is a value of theta that maximises the likelihood function.

Question 5

Q

What are two possible options to find the maximum likelihood?

Answer

A

) Search - Exhaustive(low dimensional) or Grid.

2. ) Optimization Algorithms

Question 6

Q

What is a Cost Function?

Answer

A

Maps a set of events into a number that represents the “cost” of the event occuring.

Also know as the loss or objective function.

Question 7

Q

What is the cost function for likelihood, and why is it used?

Answer

A

J(theta, D) = -log(L(theta, D))

Convention: many optimization problems are minimization.
Convenience.
Numerically Stable: Product of theta will quickly converge to zero.

Question 8

Q

What are optmization problems and their procedure?

Answer

A

Finding the best solution for the feasible ones.

Construct a model.
Determine the problem type.
Select algorithm.

Question 9

Q

What is the difference between supervised and unsupervised machine learning?

Answer

A

Supervised: Given some training data, want to train a model to explain some data.

Unsupervised: Given some unlabelled samples, want to divide into multiple groups.

Question 10

Q

What is Gradient Descent?

Answer

A

A first-order iterative algorithm for finding a local minimum of a differentiable cost function.

Employ negative gradient at each step to decrease cost function.

Two ingredients - direction and magnitude (step size).

Question 11

Q

What is Classification?

Answer

A

Determining the most likely class that an input pattern belongs to.

Question 12

Q

What is Logistic Regression?

Answer

A

Regression model where dependent variable is categorical.

Goal is to predict the probability that a given example belongs to “1” class versus the probability it belongs to the “0” class.

Also known as logit regression.

Question 13

Q

How does Logistic Regression work?

Answer

A

Use logarithm of the odds to model the binary prediction as a linear combination of independent variables.

Then use logistic function to convert log-odds to probability.