Feedforward Neural Networks 1 Flashcards

Question 1

Q

Neural Network

Answer

A

Uses only one hidden layer
Splits complex (non-linearly separable) feature spaces (can’t be done with linear functions)
Used in simple problems where relationship between input/output is straightforward

Question 2

Q

Why can’t we use Linear Model for everything?

Answer

A

Linear models can’t learn FEATURE COMBINATIONS.

Question 3

Q

Linear Features

Answer

A

treating each feature independently (not connected to any other features)
in text classification, prob/weight of each word is calculated separately and later summed
uses logistic regression

Question 4

Q

Feature Combinations

Answer

A

Process of combining two or more features to create NEW, MORE COMPLEX features
More complex, involving non-linear interactions, polynomial features, or higher-order relationships

Question 5

Q

XOR

Answer

A

Simple non-linear function
Demonstrates that linear models cannot solve non-linearly separable problems.

Question 6

Q

Deep Neural Networks

Answer

A

Uses two or more hidden layers
Can split even more complex feature spaces
Used for tasks requiring learning from large amounts of unstructured data

Question 7

Q

Immediate Conjunctive Features

Answer

A

Specific combination of two or more features to create a NEW feature
Simple and direct combinations of existing features

Question 8

Q

softmax()

Answer

A

Converts raw scores from output layer into probabilities that sum to one
Works by first exponentiating each element in the input vector and then normalizes them
“Different weights, same feature”

Question 9

Q

Feedforward Neural Network

Answer

A

Type of neural network where connections between nodes do not form cycles (can’t go backwards)
Neurons in each layer are fully connected to neurons in next layer
Each neuron applies an activation function
After producing the output, the network computes a loss function and backpropagates

Question 10

Q

Loss Function

Answer

A

Quantifies the difference between the predicted output and the actual target

Question 11

Q

Backpropagation

Answer

A

Gradients of the loss function are calculated and used to adjust the weights in the opposite direction

Question 12

Q

Log Likelihood

Answer

A

Simply the natural logarithm of the likelihood function.
Probability distributions involve products of probabilities.
Log transforms these into sums, simplifying the optimization process.

Question 13

Q

Likelihood

Answer

A

Quantifies how well model explains observed data given certain parameters
Measures the probability of observing the data under the model

Question 14

Q

Why maximize log likelihood?

Answer

A

Helps to find neural network parameters that best explain the observed data
Higher values indicate that model predicts observed data more accurately, making it useful for model evaluation
Incorporates regularization to prevent overfitting

Question 15

Q

Gradient of the Loss Function

Answer

A

Optimization algorithm
Provides info on how to adjust parameters to minimize the loss
If gradient is positive, indicates that increasing parameters will increase loss
If gradient is negative, increasing parameters will decrease loss

Feedforward Neural Networks 1 Flashcards

(15 cards)