Chapter 3 Getting started with neural networks Flashcards

Question 1

Q

relu function vs. sigmoid

Answer

A

relu (rectified linear unit) zeroes out negative values, sigmoid “squashes” arbitrary values into the [0, 1] interval

Question 2

Q

Advantages of larger layers?

Answer

A

smaller layers can act as information bottlenecks permanently dropping important information that other layers won’t have access to.

Question 3

Q

softmax activation

Answer

A

network will output a probability distribution over the different output classes that sums to 1

Question 4

Q

Categorical_crossentropy

Answer

A

loss function which measures the distance between two probability distributions.

Question 5

Q

network overfitting

Answer

A

when the network starts to get trained to specific features of a data set rather than learning overall trends

Question 6

Q

what loss function should you use for single-label, multiclass classification problems?

Answer

A

categorical crossentropy

Question 7

Q

feature-wise normalization

Answer

A

best practice for data pre-processing. For each feature in the input data you subtract by the mean and divide by the standard deviation

Question 8

Q

amount of data and overfitting

Answer

A

the less data you have, the worse overfitting is

Question 9

Q

what is one way to mitigate overfitting?

Answer

A

use smaller networks

Question 10

Q

widely used loss function for regression problems?

Answer

A

Mean Squared Error

Question 11

Q

Mean Squared Error

Answer

A

loss function: the square of difference between the predictions and the targets

Question 12

Q

Mean Absolute Error (MAE)

Answer

A

metric for monitoring model performance in regression problems

Chapter 3 Getting started with neural networks Flashcards

(12 cards)