DL-09 - Generative models Flashcards by Rikard Donnelly

DL-09 - Generative models

What are the types of generative models mentioned in the lecture slides? (2)

Variational autoencoders
Generative adversarial networks

How well did you know this?

Not at all

Perfectly

DL-09 - Generative models

What is VAE short for?

Variational autoencoder

How well did you know this?

Not at all

Perfectly

DL-09 - Generative models

What is GAN short for?

Generative Adversial Networks

How well did you know this?

Not at all

Perfectly

DL-09 - Generative models

What is a loss function you might use in an autoencoder

MSE

How well did you know this?

Not at all

Perfectly

DL-09 - Generative models

What is a key consideration when building an autoencoder related to the latent space?

The dimensionality of the latent space.

How well did you know this?

Not at all

Perfectly

DL-09 - Generative models

The dimensionality of the latent space impacts what in an autoencoder?

The quality of the results, with smaller latent spaces typically leading to poorer outcomes.

How well did you know this?

Not at all

Perfectly

DL-09 - Generative models

What are the two main approaches for implementing decoders in autoencoders? (2)

Traditional techniques like k-NN (Average value of nearest pixels) or bilinear interpolation
Transposed convolution

How well did you know this?

Not at all

Perfectly

DL-09 - Generative models

Autoencoders typically use _______ for implementing decoders. (1)

Transposed Convolution

How well did you know this?

Not at all

Perfectly

DL-09 - Generative models

What is a transposed convolution?

Upscaling of an image

How well did you know this?

Not at all

Perfectly

DL-09 - Generative models

How do you perform a transposed convolution?

1) Multiply the input by the kernel.
2) Position the intermediate value inside a larger tensor.
3) Sum up all the values.

(See image)

How well did you know this?

Not at all

Perfectly

DL-09 - Generative models

What is the main difference between AEs and VAEs?

AEs are deterministic, same output
VAEs are probabilistic (random), can generate new output

How well did you know this?

Not at all

Perfectly

DL-09 - Generative models

What is the architecture for a VAE?

(See image)

How well did you know this?

Not at all

Perfectly

DL-09 - Generative models

What model is in the image? (See image)

A variational autoencoder.

How well did you know this?

Not at all

Perfectly

DL-09 - Generative models

Which format does VAE use to describe each latent attribute?

VAE uses probability distributions.

How well did you know this?

Not at all

Perfectly

DL-09 - Generative models

How does VAE generate a vector for the decoder model?

VAE randomly samples from each latent state distribution.

How well did you know this?

Not at all

Perfectly

DL-09 - Generative models

What does an ideal autoencoder learn?

Descriptive attributes of input data in a compressed representation.

E.g. from a face:
- Smile
- Gender
- Beard
…

How well did you know this?

Not at all

Perfectly

DL-09 - Generative models

Describe visually how VAEs represent latent attributes.

(See image)

How well did you know this?

Not at all

Perfectly

DL-09 - Generative models

What is a latent attribute?

A latent attribute is a hidden descriptive feature of the data, such as gender, emotion, or skin tone in facial images.

How well did you know this?

Not at all

Perfectly

DL-09 - Generative models

Describe how we generate new data with a VAE.

Sample latent attributes.
Send to decoder.
Decoder generates output.

(See image)

How well did you know this?

Not at all

Perfectly

DL-09 - Generative models

What should happen for values that are close in latent space?

They should produce very similar reconstructions.

How well did you know this?

Not at all

Perfectly

DL-09 - Generative models

How can we achieve an interpretation of what a VAE network is learning?

By perturbating one latent variable while keeping all other variables fixed.

How well did you know this?

Not at all

Perfectly

DL-09 - Generative models

What is an approach to encourage independence of latent features in VAEs?

Applying independent component analysis (ICA) to the encoder output.

How well did you know this?

Not at all

Perfectly

DL-09 - Generative models

Why do we want features to be uncorrelated?

To learn the richest and most compact representation possible.

How well did you know this?

Not at all

Perfectly

DL-09 - Generative models

If your representations are rich and compact, what feature do we need in the latent space?

We want features to be uncorrelated.

How well did you know this?

Not at all

Perfectly

# DL-09 - Generative models What terms does the VAE loss consist of?

Reconstruction loss + KL divergence term (See image)

# DL-09 - Generative models What is KL divergence short for?

Kullback-Leibler divergence

# DL-09 - Generative models What is KL divergence?

KL divergence is a measure of how one probability distribution differs from another.

# DL-09 - Generative models How do you compute KL divergence for VAE loss? (See image)

By setting a fixed prior distribution 𝑝(𝑧) for 𝑞(𝑧|𝑥^hat) based on some initial hypothesis or guess, and model learns 𝑝(𝑧|𝑥) using this prior. (See image)

# DL-09 - Generative models What is the general formula for KL divergence?

(See image)

# DL-09 - Generative models What formula is this?

(See image)

# DL-09 - Generative models What prior distribution do we assume for p(z) in VAE?

A normal gaussian with mean=0 and sd=1.

# DL-09 - Generative models What is the actual formula we use for KL divergence in a VAE?

(See image)

# DL-09 - Generative models Why would it be difficult to backprop in VAEs?

we cannot backpropagate gradients through sampling layers because of the stochastic nature. (See image)

# DL-09 - Generative models What trick we use to make backprop work in VAEs?

Reparameterization. (See image)

# DL-09 - Generative models Describe how the reparameterization trick works for VAEs.

(See image)

# DL-09 - Generative models How can VAEs be used to uncover bias in a dataset?

(see image)

# DL-09 - Generative models How can we use VAEs to automatically debias data?

(See image)

# DL-09 - Generative models How can (V)AEs denoise images?

Train a VAE with noisy inputs and predict clean output.

# DL-09 - Generative models How can (V)AEs be used for anomaly detection?

Measure reconstruction loss. If above some threshold based on training data, assume outlier.

# DL-09 - Generative models What is the main idea behind GAN?

The back-and-forth competition between the discriminator and the generator (forger).

# DL-09 - Generative models Which two components compete in a GAN?

The generator (forger) and the discriminator.

# DL-09 - Generative models What is the generator's role in a GAN?

The generator learns to create plausible data from noise/random input.

# DL-09 - Generative models What is the discriminator's role in a GAN?

The discriminator learns to distinguish between fake data produced by the generator and real data.

# DL-09 - Generative models How do GANs improve their performance?

By training both models simultaneously through an adversarial process.

# DL-09 - Generative models What is the goal of a generative network in a GAN?

To produce data that are indistinguishable from real data, e.g. images.

# DL-09 - Generative models What input does a generator work on?

Randomly sampled noise.

# DL-09 - Generative models Describe how backprop works in the discriminator.

(See image)

# DL-09 - Generative models Describe how backprop works in the generator.

(See image)

# DL-09 - Generative models How does the GAN training process go?

In alternating periods. - The discriminator trains for 1+ epochs. - Then the generator trains for 1+ epochs.

# DL-09 - Generative models What are some commonly used loss functions for GANs? (2)

- Min-max loss - Wasserstein

# DL-09 - Generative models What's the formula for min-max loss?

(See image)

# DL-09 - Generative models In min-max loss, what are the different terms?

# DL-09 - Generative models What roles do the discriminator/generator have in min-max loss? I.e. how do they impact it? (2)

- Generator: minimize - Discriminator: maximize

# DL-09 - Generative models What are some common issues with min-max loss? (4) (VMUC)

- vanishing gradients - mode collapse - unbalanced updates - convergence

# DL-09 - Generative models In min-max loss, when does the problem of vanishing gradients occur?

If the discriminator is too good.

# DL-09 - Generative models In min-max loss, what is mode collapse?

a situation when the discriminator is too effective, causing generator to focus on producing only a few types of outputs rather than a diverse range of outputs.

# DL-09 - Generative models In min-max loss, what is the problem of unbalanced updates?

min-max loss requires generator and discriminator be trained alternately that can lead to unbalanced updates, causing one dominating the other => instability.

# DL-09 - Generative models What does the generator do when mode collapse occurs in min-max loss?

The generator produces only a few types of outputs instead of a diverse range.

# DL-09 - Generative models In min-max loss, what can lead to unbalanced updates and instability?

Training the generator and discriminator alternately.

# DL-09 - Generative models In min-max loss, What is the problem of convergence?

Convergence is when the generator gets better and the discriminator performs worse, leading to random feedback and degrading generator quality.

# DL-09 - Generative models What is WGAN short for?

Wasserstein GAN

# DL-09 - Generative models What is the difference between Wasserstein GANs and normal GANs?

Wasserstein GANs (WGANs) use a critic network, which outs real values instead of the discriminator's binary real/fake.

# DL-09 - Generative models How are Wasserstein GANs different from a GAN with min-max loss?

- Discriminator: Outputs are binary (either real/fake). - Critic network: Outputs are real, e.g. probability of real/fake.

# DL-09 - Generative models What components does the Wasserstein loss consist of? (2)

- Critic loss - Generator loss

# DL-09 - Generative models What's the formula for WGAN's critic loss?

D(x) - D(G(z)) Disciminator tries to maximize this. D(x) - critic loss for a real instance D(G(z)) - Critic's eval of a fake instance

# DL-09 - Generative models What's the formula for WGAN's generator loss?

D(G(z)) - Generator tries to maximize the critic's eval of a fake instance

# DL-09 - Generative models What are some benefits of using WGANs? (2)

Less vulnerable to: - Vanishing gradient - Mode collapse

# DL-09 - Generative models What is a progressive GAN?

A GAN that starts out with small network producing low-resolution images, then adds more layers over time.

# DL-09 - Generative models What are some benefits of using a progressive GAN? (2)

- Faster training - Higher resolution outputs

# DL-09 - Generative models What is DCGAN short for?

Deep Convolutional GAN

# DL-09 - Generative models Explain what an Image-to-image translation GAN is.

Takes an image as input and map it to a generated output image with different properties. (See image)

# DL-09 - Generative models Describe what a Cycle GAN is.

CycleGANs learn to transform images from one set (domain) into images that could plausibly belong to another. (See image)

# DL-09 - Generative models What is a Super-resolution GAN?

A GAN that tries to unblur an input image. (See image)

# DL-09 - Generative models What is Face inpainting?

(See image)

DL-09 - Generative models Flashcards

(74 cards)