Diffusion Models Flashcards
1
Q
3 parts of stable diffusion model
A
A language model, a diffusion model, and a decoder
2
Q
At a high level what does the language model in stable diffusion do?
A
transforms the text prompt you enter to a representation that can be fed to the diffusion model
3
Q
SD’s diffusion model is what?
A
Basically a time conditional U-Net
4
Q
What does SD’s diffusion model take as input?
A
some Gaussian noise and the representation of the text prompt
5
Q
What does SD’s diffusion model do with its inputs ?
A
Denoise (for several times) the Gaussian noise to get closer to the representation of your text prompt [IIRC Gaussian noise is one of the inputs]