VQ-VAE Flashcards

1
Q

What are the 3 main contributions of VQ-VAE in comparison to VAE

A

1 Quantisation of the latent space
2 Restrict the latent space to a set of vectors (codebook).
3 The prior of the codebook is learned rather than static

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How does it changes the AE framework

A

Quantisation and restriction of the latent space to predefined codewords.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How does it changes the sampling procedure of VAE?

A

It assumes that the prior is a uniform distribution over the codebook and the posterior of the decoder input is a delta function that gives back the nearest codeword

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What happens to the KL divergence

A

Becomes constant and is removed from the learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What happens to the prior

A

It is learned in the training process

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the loss?

A

The reconstruction loss, the codebook loss and the commitment loss

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the codebook loss?

A

∣∣sg[Ze(x)]−ek∣∣2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the commitment loss?

A

β∣∣Ze(x)−sg[ek]∣∣2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why is VQ-VAE more efficient than pixel-space autoregressive models when generating images?

A

Because it samples an autoregressive model only in the compressed latent space.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is an autoregressive model?

A

An autoregressive model is a type of model where the output at each step depends on the input and previously generated outputs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

VQ-VAE-2 - How do they contribute in comparison to VQ-VAE?

A

They demonstrate that a multi-scale hierarchical organization of VQ-VAE, augmented with powerful priors over the latent codes, can generate samples with quality that rivals GAN.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly