Theory of Learning from Data Flashcards

Question 1

Q

Was ist eine Risk Function?

Question 2

Q

Wie unterscheiden sich true (expected) risk und empirical risk?

Answer

A

true expected risk ist das Risiko/der Verlust auf unbekannten Datan
empiricak risk ist das Risiko/der Verlust auf bekannten Daten

Question 3

Q

Was bedeutet VC(H)?

Question 4

Q

Gib VC(H) an: Linear classifiers for d features plus a constant term b

Question 5

Q

Gib VC(H) an: Decision tree of rank r that defines Boolean functions
on n boolean variables

Question 6

Q

Gib VC(H) an: Neural networks

Answer

A

VC(H) ≈ #parameters

Question 7

Q

Gib VC(H) an: Linear classifier in 2D mit drei Punkten

Answer

A

VC(H) = 3

Question 8

Q

Was ist Structural Risk Minimization?

Answer

A

Risk Calculation of different Models

Question 9

Q

Wie geht der Satz von Bayes?

Question 10

Q

Wie unterscheiden sich bayesian view und cost funtion view?

Question 11

Q

Gib die Bayesian probabilistic formulation

Question 12

Q

Wie hängen bayesian view und cost function view zusammen?

Question 13

Q

Wie kann man Modellkomplexität verringern?

Question 14

Q

Wie kann man Parametergrößen “restricten” (beschränken)

Question 15

Q

Beschreib Regularizer (L2 norm)

Question 16

Q

Beschreib Regularizer (L1 norm)

Answer

Study These Flashcards

A

Question 17

Q

Erkläre Cross Validation

Answer

Study These Flashcards

A

Question 18

Q

Which of the following statements on the different kinds of cross-validation are correct?
1. The leave-one-out method is a special form of k-fold cross-validation.
2. Cross-validation is used to find the best training data to train a model.
3. The bootstrap resampling technique involves dividing the dataset into multiple partitions, evaluating each subset individually as test data after training on the rest.
4. A major advantage of k-fold cross-validation is that it is a fast method to test the quality of the chosen model.

Answer

Study These Flashcards

A

1

Question 19

Q

The Vapnik Chervonenkis (VC) dimension of a classifier H is the cardinality of the smallest set that can be fully represented by H.
Ist das Wahr?

Answer

Study These Flashcards

A

Nein, actually it is the largest set a classifier H can fully represent.

Question 20

Q

Which of the following statements on VC theory are correct?
1. A larger model complexity implies a smaller empirical risk.
2. The effective model complexity is fixed during the course of training.
3. The empirical risk is a good measure for the generalization capabilities of a model.
4. Structural risk minimization balances empirical risk and VC dimension.

Answer

Study These Flashcards

A

1 und 4

Theory of Learning from Data Flashcards

(20 cards)