VQ-VAE Flashcards

Question 1

Q

VQ-VAE - what are the 3 main contributions of VQ-VAE in comparison to VAE

Answer

A

1 Quantisation of the latent space
2 Restrict the latent space to a linear combination of a set of vectors (codebook).
3 The prior of the codebook is learned rather than static

Question 2

Q

VAE - what are the 4 main principles that VAE is based on

Answer

A

1 AE framework with continuous latents
2 A Gaussian sampling procedure
3 The prior of the latent vector follows a Gaussian distribution with u=0 and sigma=1
4 a reconstruction loss and a KL divergence loss

Question 3

Q

VQ-VAE - how does it changes the AE framework

Answer

A

There’s a quantisation step in the middle which convert the continuous feature extraction output to a quantised entity

Question 4

Q

VQ-VAE - How does it changes the sampling procedure?

Answer

A

It assumes that the prior is a uniform distribution over the codebook and the posterior of the decoder input is a delta function that gives back the nearest codeword

Question 5

Q

VQ-VAE - what happens to the KL divergence

Answer

A

Becomes constant and is removed from the learning

Question 6

Q

VQ-VAE - What happens to the prior

Answer

A

It is learned in the training process

Question 7

Q

VQ-VAE - what is the loss?

Answer

A

The reconstruction loss, the codebook loss and the commitment loss

Question 8

Q

VQ-VAE - What is the codebook loss?

Answer

A

MSE from the features of the input to the codeword (putting stop-gradient on the input features)

Question 9

Q

VQ-VAE - What is the commitment loss?

Answer

A

The squared distance between the encoder to the stop-gradient of its closest codeword.

Brainscape's Knowledge GenomeTM

VQ-VAE Flashcards

Brainscape's Knowledge Genome^TM