Diffusion Models Flashcards

1
Q

3 parts of stable diffusion model

A

A language model, a diffusion model, and a decoder

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

At a high level what does the language model in stable diffusion do?

A

transforms the text prompt you enter to a representation that can be fed to the diffusion model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

SD’s diffusion model is what?

A

Basically a time conditional U-Net

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does SD’s diffusion model take as input?

A

some Gaussian noise and the representation of the text prompt

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does SD’s diffusion model do with its inputs ?

A

Denoise (for several times) the Gaussian noise to get closer to the representation of your text prompt [IIRC Gaussian noise is one of the inputs]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly