Feedforward Neural Networks 2 Flashcards by Katherine Fadeyeva

Q

Node

A

Processes input data and contributes to the network’s output
can be tensor, matrix, vector, or scalar value
Input can be from features from data or output value of other node

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

Edge

A

Represents a function argument
Pointers to nodes
Can be adjusted during optimization to make better predictions

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

Epoch

A

One full cycle through the entire training dataset
Adjust parameters (weights and biases) based on the gradients of the loss function

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

Training a Model Steps

A

Define a computation graph (FFNN class)
For each epoch:
a. for each batch of data: compute loss, autograd to compute gradients, and take step with optimizer
b. evaluate on validation set to avoid overfitting

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

Shuffling the Training Data

A

What if “I love you” at end of training set 1000 times
Parameters will be inaccurately updated
Randomly shuffling the order at each time step of before training epoch

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

Early Stopping

A

Prevents overfitting
Stop training when performance starts to decline
Return best parameters

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

Training Tricks

A

Shuffling the training data
Early stopping
Parameter dropout

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

Parameter Dropout

A

Prevents overfitting
Randomly setting a portion of the model’s parameters (weights) to zero
Done only at training time

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

Mini-batching

A

Splitting the training set into smaller, manageable subsets
Combines smaller operations into one big one (more efficient computation)

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

Batch Data Loading

A

Instead of processing one single sentence, process mini-batch of sentences
Must perform sentence padding on mini-batches

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

Sentence Padding

A

done during batch data loading
pad shorter sentences in batch to match lengths

How well did you know this?

1

Not at all

2

3

4

5

Perfectly