Week 5: Modern Computer Vision Applications 2 Flashcards

1
Q

What is GAN in the context of image generation?

A

GAN stands for Generative Adversarial Network, a deep learning framework comprising two neural networks, a generator, and a discriminator, competing in a game-like scenario to produce realistic images.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does GAN stand for in the context of pre-trained architectures on various datasets?

A

GAN stands for Generative Adversarial Network, which involves two neural networks, a generator and a discriminator, trained adversarially to produce synthetic data. Pre-trained GAN models are models that have been trained on large datasets and are capable of generating realistic images specific to those datasets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the goal of generative models?

A

To generate realistic data, such as images, text, or audio.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are some examples of generative models?

A

Generative Adversarial Networks (GANs), Variational Autoencoders (VAEs), and Deep Generative Models (DGMs).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the basic structure of a GAN?

A

It consists of two neural networks: a generator and a discriminator. The generator tries to create realistic data, while the discriminator tries to distinguish between real and fake data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How are GANs trained?

A

The generator and discriminator are trained in an adversarial manner. The generator is updated to better fool the discriminator, and the discriminator is updated to better identify fake data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are some applications of image-to-image translation?

A

Photo editing, style transfer, medical imaging, and data augmentation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is pix2pix?

A

A conditional GAN architecture for paired image-to-image translation. It requires a training dataset of paired images (e.g., grayscale and color images).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is CycleGAN?

A

A GAN architecture for unpaired image-to-image translation. It does not require paired images and instead relies on cycle consistency loss to ensure that the translated images are realistic.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are some challenges with training GANs?

A

Unstable training, mode collapse, and difficulty in evaluating the quality of the generated data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly