Learning from Data: Images Flashcards

1
Q

How are images represented within a computer?

A

Images are represented as numerical matrices, where each pixel is a value indicating its intensity (e.g., 0–255 for grayscale). For color images, three matrices (red, green, blue channels) are used.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What was one of the major issues preventing widespread use of CNNs after LeNet’s introduction?

A

Limited computational power and lack of large-scale labeled datasets prevented the widespread use of CNNs after LeNet’s introduction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How do residual (skip) connections help address the vanishing gradient problem?

A

Residual connections allow gradients to bypass certain layers, preserving their strength during backpropagation and enabling the training of deeper networks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why can’t traditional dense neural networks effectively process image data?

A

Dense networks flatten images into vectors, losing the spatial structure (e.g., pixel relationships), and require impractical numbers of parameters to handle high-dimensional image data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is fine-tuning in the context of CNN models?

A

Fine-tuning is adapting a pre-trained CNN model to a new task by adjusting its parameters on a smaller, task-specific dataset.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the purpose of applying convolutional filters to an image?

A

Their purpose is to detect specific patterns and features in images (the data they’re given).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the four types of detection tasks that CNN filters can perform according to the slides?

A
  1. Edge detection
  2. Corner detection
  3. Texture detection
  4. Object detection.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the significance of CNNs being ‘equivariant to translations’?

A

Because if we shift the features of an image around, the CNN will still be able to detect those features.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why does shifting an image cause problems when using a flattened vector approach to image processing?

A

It disrupts spatial relationships between pixels, making it harder to recognize patterns like edges or shapes. It thinks it’s a different image

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What role did GPUs play in advancing CNN technology?

A

GPUs enabled faster training of CNNs by handling the massive parallel computations required for convolutional operations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why might a fine-tuned ResNet model perform better than a model trained from scratch on a specific task?

A

A fine-tuned ResNet uses pre-trained features, saving time and data, while a model trained from scratch has to learn everything from the beginning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the two reasons that CNNs have become more widely used in research?

A
  1. Advances in hardware (GPUs)
  2. Availability of large datasets.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly