PAC Flashcards

Question 1

Q

PAC?

Answer

A

Probably Approximately Correct Learning

Question 2

Q

What is Sample Complexity?

Answer

A

The class of learnable concepts in terms of the number of sample points needed to achieve an approximate solution.

Question 3

Q

What is “X” or Input Space?

Answer

A

The set of all possible examples or instances.

Question 4

Q

What is “Y”?

Answer

A

The set of all possible labels or target values.

Question 5

Q

What is Concept?

Answer

A

A concept c is a mapping rom X to Y - X -> Y. For example all the points which fall into a triangle.

Question 6

Q

What is Concept Class?

Answer

A

Denoted as C. It is the set of concepts we may wish to learn. E.X be the set of all axis-aligned rectangles.

Question 7

Q

What is a Hypothesis Set?

Answer

A

Denoted as H. It is a fixed set of possible concepts which may not coincide with C.

Question 8

Q

What is Generalization Error?

Answer

A

Generalization error of hypothesis h also known as true error or just error is denoted by R(h) (risk).
R(h) = Pr [h(x) != c(x)] = E 1_ h(x) != c(x)
The Generalization error is not directly accessible to the learner since both the distribution D and the target concept c are unknown. GR is the expected error based on the distribution D. The expectation of the empirical error R^(h) is also the generalization error = E[R^(h)] = R(h).

Question 9

Q

What is Empirical Error?

Answer

A

R^(h) = 1 / m * SUM(i=1…m) 1 if h(xi) != c(xi) else 0

Thus Empirical error is the average error over the sample S.

Question 10

Q

What is the definition of PAC-learning?

Answer

A

A concept class C is said to be PAC-learnable if there exists an algorithm A and a polynomial function P such that for any epsilon > 0 and delta > 0 for all distributions D on X and for any target concept c WBI C, the following holds for any sample size m >= poly(1/eps, 1/delta, n, size(c))
Pr( [R(h_S) ) <= eps] >= 1 - delta

Question 11

Q

What is the term epsilon e?

Answer

A

Its the error term. It tells that we are approximately correct. How accurate the algorithm is (1 - eps).

Question 12

Q

What is the term delta d?

Answer

A

It tells that we are probably right (1 - delta). In other words the confidence of the algorithm.

Question 13

Q

What are false negatives?

Answer

A

Points that the hypothesis misses that are actually inside the concept class but not inside the hypothesis.

Question 14

Q

What are false positives?

Answer

A

Points that the hypothesis flags as being class A but in truth are not. So they are not inside the concept class A but are inside the hypothesis.

Question 15

Q

What is a consistent hypothesis?

Answer

A

It is a hypotheses which has empirical error R^(h) = 0.

Question 16

Q

What is the version space?

Answer

Study These Flashcards

A

The set of all consistent hypotheses of the hypothesis class.

Question 17

Q

What is the most general hypothesis G?

Answer

Study These Flashcards

A

It is the hypothesis which cannot be expanded without including negative training examples. “The largest rectangle”

Question 18

Q

What is the most specific hypothesis S?

Answer

Study These Flashcards

A

It is the hypothesis which cannot be made smaller without excluding positive training points. “The smallest rectangle”

PAC Flashcards

(18 cards)