Week 5: Modern Computer Vision Applications 2 Flashcards

Question 1

Q

What is GAN in the context of image generation?

Answer

A

GAN stands for Generative Adversarial Network, a deep learning framework comprising two neural networks, a generator, and a discriminator, competing in a game-like scenario to produce realistic images.

Question 2

Q

What does GAN stand for in the context of pre-trained architectures on various datasets?

Answer

A

GAN stands for Generative Adversarial Network, which involves two neural networks, a generator and a discriminator, trained adversarially to produce synthetic data. Pre-trained GAN models are models that have been trained on large datasets and are capable of generating realistic images specific to those datasets.

Question 3

Q

What is the goal of generative models?

Answer

A

To generate realistic data, such as images, text, or audio.

Question 4

Q

What are some examples of generative models?

Answer

A

Generative Adversarial Networks (GANs), Variational Autoencoders (VAEs), and Deep Generative Models (DGMs).

Question 5

Q

What is the basic structure of a GAN?

Answer

A

It consists of two neural networks: a generator and a discriminator. The generator tries to create realistic data, while the discriminator tries to distinguish between real and fake data.

Question 6

Q

How are GANs trained?

Answer

A

The generator and discriminator are trained in an adversarial manner. The generator is updated to better fool the discriminator, and the discriminator is updated to better identify fake data.

Question 7

Q

What are some applications of image-to-image translation?

Answer

A

Photo editing, style transfer, medical imaging, and data augmentation.

Question 8

Q

What is pix2pix?

Answer

A

A conditional GAN architecture for paired image-to-image translation. It requires a training dataset of paired images (e.g., grayscale and color images).

Question 9

Q

What is CycleGAN?

Answer

A

A GAN architecture for unpaired image-to-image translation. It does not require paired images and instead relies on cycle consistency loss to ensure that the translated images are realistic.

Question 10

Q

What are some challenges with training GANs?

Answer

A

Unstable training, mode collapse, and difficulty in evaluating the quality of the generated data.

Week 5: Modern Computer Vision Applications 2 Flashcards

(10 cards)