Lecture 10 - Recognition Flashcards

Question 1

Q

What is image classification?

Answer

A

Image classification is the task of assigning a label from a predefined set of categories to an entire image.

Question 2

Q

Describe the architecture of LeNet.

Answer

A

LeNet is one of the first convolutional networks designed for handwritten digit recognition, featuring layers of convolutional filters followed by subsampling (pooling) and fully connected layers.

Question 3

Q

What is AlexNet and its significance?

Answer

A

AlexNet is a convolutional network that won the ImageNet Large Scale Visual Recognition Challenge (ILSVRC) in 2012, significantly outperforming previous models with its deep architecture and use of ReLU activation and dropout.

Question 4

Q

Explain dropout in neural networks.

Answer

A

Dropout is a regularization technique where a random subset of neurons is ignored during training, preventing overfitting by ensuring the model does not rely too heavily on any single neuron.

Question 5

Q

What is data augmentation and its purpose?

Answer

A

Data augmentation artificially enlarges the training dataset by applying label-preserving transformations such as cropping, flipping, and color jittering to improve model generalization.

Question 6

Q

Describe the purpose of batch normalization.

Answer

A

Batch normalization normalizes the inputs of each layer to have a mean of zero and variance of one, accelerating training and improving model performance by reducing internal covariate shift.

Question 7

Q

What are the key features of VGGNet?

Answer

A

VGGNet uses small 3x3 convolutional filters stacked together, leading to deep networks with more non-linearities, which improve performance while maintaining computational efficiency.

Question 8

Q

Explain the Inception module in GoogLeNet.

Answer

A

The Inception module in GoogLeNet combines multiple convolutional filters of different sizes and pooling operations within the same layer, allowing the network to capture features at various scales.

Question 9

Q

What is a Residual Network (ResNet)?

Answer

A

ResNet introduces residual blocks with shortcut connections that bypass one or more layers, allowing very deep networks to be trained by mitigating the vanishing gradient problem.

Question 10

Q

Describe the concept of feature learning in CNNs.

Answer

A

Feature learning in CNNs refers to the automatic extraction and learning of hierarchical features from raw input images, replacing the need for manual feature engineering.

Question 11

Q

What is the role of the softmax function in CNNs?

Answer

A

The softmax function is used in the output layer of CNNs for multi-class classification, converting raw output scores into probabilities by normalizing them.

Question 12

Q

Explain the concept of transfer learning in CNNs.

Answer

A

Transfer learning involves using a pre-trained network on a large dataset and fine-tuning it on a smaller, task-specific dataset, leveraging the learned features to improve performance and reduce training time.

Question 13

Q

Write the formula for the softmax function.

Question 14

Q

Provide the formula for the ReLU activation function.

Question 15

Q

What is the formula for updating weights using gradient descent in neural networks?

Question 16

Q

How did AlexNet overcome overfitting?

Answer

A

AlexNet used dropout on the fully connected layers and data augmentation techniques to overcome overfitting.

Question 17

Q

What are the benefits of using small convolutional filters in VGGNet?

Answer

A

Small convolutional filters (3x3) in VGGNet allow for deeper networks with more non-linearities, improving feature learning while maintaining computational efficiency.

Question 18

Q

Explain how the Inception module improves performance in GoogLeNet.

Answer

A

The Inception module improves performance by combining multiple convolutional filters and pooling operations, capturing features at various scales and reducing computational complexity.

Question 19

Q

Why are residual connections important in ResNet?

Answer

A

Residual connections in ResNet allow the network to bypass certain layers, helping to prevent the vanishing gradient problem and enabling the training of very deep networks.

Question 20

Q

How does batch normalization accelerate training?

Answer

A

Batch normalization accelerates training by normalizing the inputs of each layer, reducing internal covariate shift, and allowing for higher learning rates.

Question 21

Q

Describe the role of data augmentation in improving model generalization.

Answer

A

Data augmentation improves model generalization by increasing the diversity of the training data through label-preserving transformations, helping the model learn to be invariant to various image distortions.

Question 22

Q

What is the significance of using multiple GPUs in training AlexNet?

Answer

A

Using multiple GPUs allowed AlexNet to train faster and handle larger models by dividing the workload, making it feasible to train on large datasets like ImageNet.

Question 23

Q

How does dropout prevent overfitting in neural networks?

Answer

A

Dropout prevents overfitting by randomly dropping neurons during training, ensuring the network does not rely too heavily on any single neuron and promoting redundancy.

Question 24

Q

What are the main contributions of the LeNet architecture?

Answer

A

LeNet was one of the first successful applications of convolutional networks, demonstrating their effectiveness in recognizing handwritten digits with high accuracy and efficiency.

Question 25

Q

Describe the process of transfer learning in the context of CNNs.

Answer

A

Transfer learning involves taking a pre-trained CNN on a large dataset, such as ImageNet, and fine-tuning it on a smaller, task-specific dataset to leverage learned features and improve performance.

Question 26

Q

Explain the concept of feature learning in CNNs.

Answer

A

Feature learning in CNNs refers to the automatic extraction and learning of hierarchical features from raw input images through convolutional layers, enabling the network to learn relevant patterns for classification tasks.

Question 27

Q

What challenges do deeper networks face and how are they addressed?

Answer

A

Deeper networks face challenges like vanishing/exploding gradients, slower training, and overfitting. These are addressed with techniques such as residual connections, batch normalization, dropout, and data augmentation.