Learning from Data: Images Flashcards

Question 1

Q

How are images represented within a computer?

Answer

A

Images are represented as numerical matrices, where each pixel is a value indicating its intensity (e.g., 0–255 for grayscale). For color images, three matrices (red, green, blue channels) are used.

Question 2

Q

What was one of the major issues preventing widespread use of CNNs after LeNet’s introduction?

Answer

A

Limited computational power and lack of large-scale labeled datasets prevented the widespread use of CNNs after LeNet’s introduction.

Question 3

Q

How do residual (skip) connections help address the vanishing gradient problem?

Answer

A

Residual connections allow gradients to bypass certain layers, preserving their strength during backpropagation and enabling the training of deeper networks.

Question 4

Q

Why can’t traditional dense neural networks effectively process image data?

Answer

A

Dense networks flatten images into vectors, losing the spatial structure (e.g., pixel relationships), and require impractical numbers of parameters to handle high-dimensional image data.

Question 5

Q

What is fine-tuning in the context of CNN models?

Answer

A

Fine-tuning is adapting a pre-trained CNN model to a new task by adjusting its parameters on a smaller, task-specific dataset.

Question 6

Q

What is the purpose of applying convolutional filters to an image?

Answer

A

Their purpose is to detect specific patterns and features in images (the data they’re given).

Question 7

Q

What are the four types of detection tasks that CNN filters can perform according to the slides?

Answer

A

Edge detection
Corner detection
Texture detection
Object detection.

Question 8

Q

What is the significance of CNNs being ‘equivariant to translations’?

Answer

A

Because if we shift the features of an image around, the CNN will still be able to detect those features.

Question 9

Q

Why does shifting an image cause problems when using a flattened vector approach to image processing?

Answer

A

It disrupts spatial relationships between pixels, making it harder to recognize patterns like edges or shapes. It thinks it’s a different image

Question 10

Q

What role did GPUs play in advancing CNN technology?

Answer

A

GPUs enabled faster training of CNNs by handling the massive parallel computations required for convolutional operations.

Question 11

Q

Why might a fine-tuned ResNet model perform better than a model trained from scratch on a specific task?

Answer

A

A fine-tuned ResNet uses pre-trained features, saving time and data, while a model trained from scratch has to learn everything from the beginning

Question 12

Q

What are the two reasons that CNNs have become more widely used in research?

Answer

A

Advances in hardware (GPUs)
Availability of large datasets.

Learning from Data: Images Flashcards

(12 cards)