Neural Networks Flashcards

1
Q

What is logistic regression used for?

A

Binary Classification tasks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the sigmoid function in logistic regression?

A

σ(x) = 1 / (1 + e^-x), it converts the relation of the input with the separating hyperplane and outputs a probability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How are the weights estimated in logistic regression?

A

Using maximum likelihood estimation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a key advantage of neural networks over logistic regression?

A

Neural networks can learn complex non-linear relationships in data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the vanishing gradient problem in deep neural networks?

A

The gradient becomes very small for earlier layers, making it difficult to train deep networks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is an embedding in machine learning?

A

A learned representation of categorical data in a continuous vector space

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the purpose of an autoencoder?

A

To learn compact representations (encodings) of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a convolutional neural network (CNN) typically used for?

A

Back: Image processing and computer vision tasks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What property do convolutions provide to CNNs?

A

Translation equivariance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is transfer learning?

A

Adapting a pre-trained model to a new but related task

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What activation function helps mitigate the vanishing gradient problem?

A

ReLU (Rectified Linear Unit)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the purpose of pooling layers in CNNs?

A

To add some translation invariance and reduce spatial dimensions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the softmax function used for in neural networks?

A

To convert raw scores into probabilities for multi-class classification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the difference between batch gradient descent and stochastic gradient descent?

A

Batch uses all training samples per update, while stochastic uses one sample at a time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is mini-batch gradient descent?

A

A compromise between batch and stochastic, using a small subset of samples per update

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the purpose of momentum in gradient descent optimization?

A

To accelerate convergence and help overcome local optima

17
Q

What is a skip connection in neural networks?

A

A connection that bypasses one or more layers, helping to mitigate the vanishing gradient problem

18
Q

What is batch normalization?

A

A technique to normalize the inputs of each layer, improving training stability and speed

19
Q

What is the difference between a dense layer and a convolutional layer?

A

Dense layers connect all inputs to all outputs, while convolutional layers use local receptive fields

20
Q

What is the purpose of dropout in neural networks?

A

To prevent overfitting by randomly setting a fraction of inputs to zero during training

21
Q

What is a variational autoencoder (VAE)?

A

A probabilistic version of autoencoders that learns a continuous latent space representation

22
Q

What is the difference between an encoder and a decoder in an autoencoder?

A

The encoder compresses input data into a latent representation, while the decoder reconstructs the input from this representation