Quiz 3 Flashcards

Question

Most of the parameters in VGG are

Answer 1

In the fully connected layer

Answer 2

2x2, stride 2

Answer 3

Hundreds of millions

Answer 4

60 million

Answer 5

Repeated blocks and multi-scale features

Answer 6

if input undergoes a transformation, the output will also undergo the same transformation that is T(x)→T(a).

Answer 7

if an input undergoes a transformation, the output is unchanged. That is T(x)→a.

Answer 8

intermediate / convolutional

Answer 9

output (also in rotation)

Answer 10

Translation

Answer 11

Permutation

Answer 12

1) Take first image, compute the features 2) Take the generated image, starting with a zero or random image, and also compute the features 3) Take a style image and compute those features 4) Change generated image to minimize both losses at the same time

Answer 13

Represents feature correlations across different layers in the neural network

Answer 14

1) Take a particular layer in a CNN 2) Take a pair of channels within the feature map 3) Compute the correlation, or dot product, between the two feature maps 4)

Answer 15

Minimize the squared difference between the gram matrices (the Gram matrix of style/Gram matrix of original image and the Gram matrix of the generated image) --> this results in two losses. Total loss is the two losses with some weighting.

Answer 16

Logistic regression

Answer 17

The scores for subgroups of interest are calibrated or equally miscalibrated

Answer 18

The probability of the observations with a given probability score of having a label is equal to the proportion of observations having that label

Answer 19

An additional validation dataset

Answer 20

Learn parameters a, b so that the calibrated probability is sigmoid(az + b) where z is a parameter and b is a constant

Answer 21

Temperature scaling applies Platt scaling to multi-class classification using softmax

Answer 22

Group based (what characteristic denotes the groups?), the inherent tradeoffs on calibration

Answer 23

It is impossible for a classifier to achieve both equal calibration and error rates between groups, if there is a difference in prevalence between the groups and the classifier is not perfect

Answer 24

PPV = TP/(TP + FP)

Answer 25

For any three (or more) measures of model performance derived from the confusion matrix, in a system of equations with three more equations, p is determined uniquely: if groups have different prevalences, these quantities cannot be equal

Answer 26

Take each input pixel, multiply by learnable kernel, "stamp" it on input

Answer 27

Important pixels

Answer 28

The gradient of classifier scores (pre-softmax). Take absolute value of gradient and sum across all channels

Answer 29

Negative gradients (we only pass back the positive gradients) for forward and backwards pass

Answer 30

Compute the gradient of the score for a particular class that we care about with respect to the input image. Rather than subtracting the learning rate times the gradient, we'll add the learning rate times the gradient

Answer 31

Training with adversarial examples, perturbations, noise, or re-encoding of attacks

Answer 32

Easy examples incur a non-negligible loss, which in aggregate mask out the harder, rare examples

Answer 33

Down weights easy examples to give more attention to difficult examples

Answer 34

FL(p) = -(1 - p) y log(p)

Answer 35

Address the issue of the class imbalance problem

Answer 36

What set of input pixels in the original image affect the value of this node or activation deep inside the network

Answer 37

continues to increase over and over

Quiz 3 Flashcards

(64 cards)