How does Stable Diffusion work Flashcards

1
Q

Source of Info

A

https://stable-diffusion-art.com/how-stable-diffusion-work/

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Diffusion Model
Stable Diffusion belongs to a class of —- ——- models called ——- models.

They are ——– models, meaning they are designed to ——– new data similar to what they have —- in ———-.

In the case of Stable Diffusion, the data are ——.

A

Diffusion Model
Stable Diffusion belongs to a class of deep learning models called diffusion models.

They are generative models, meaning they are designed to generate new data similar to what they have seen in training.

In the case of Stable Diffusion, the data are images.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

hy is it called the diffusion model?

A

Because its math looks very much like diffusion in physics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is forward diffusion?

A

A forward diffusion process adds noise to a training image, gradually turning it into an uncharacteristic noise image.
The forward process will turn any cat or dog image into a noise image.
Eventually, you won’t be able to tell whether they are initially a dog or a cat.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Reverse Diffusion

A

Starting from a noisy, meaningless image, reverse diffusion recovers a cat OR a dog image.
Technically, every diffusion process has two parts: (1) drift and (2) random motion.
The reverse diffusion drifts towards either cat OR dog images but nothing in between. That’s why the result can either be a cat or a dog.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How is training done

A
  1. The answer is teaching a neural network model to predict the noise added.
  2. It is called the noise predictor in Stable Diffusion. It is a U-Net model.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Explain the 4 steps in stable diffusion training.

A
  1. Pick a training image, like a photo of a cat.
  2. Generate a random noise image.
  3. Corrupt the training image by adding this noisy image up to a certain number of steps.
  4. Teach the noise predictor to tell us how much noise was added. This is done by tuning its weights and showing it the correct answer.

After training, we have a noise predictor capable of estimating the noise added to an image.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How does reverse diffusion work

A
  1. We first generate a completely random image and ask the noise predictor to tell us the noise.
  2. We then subtract this estimated noise from the original image.
  3. Repeat this process a few times. You will get an image of either a cat or a dog.

This image is unconditioned.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How large is the Stable Diffusion model?

A

The file size of Stable Diffusion v1.5 model is 4.27GB, but it depends on what kind of models you install on your computer.
The basic setup (including Python, Git for Windows, and WebUI) takes roughly 9.37GB of HDD/SDD space.

https://okuha.com/vram-requirements-for-stable-diffusion/

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What system requirements do you need to run SD? (2)

A
  1. roughly 10GB of HDD/SDD space and roughly 8GB of GPU VRAM.
  2. You must also reserve some space for installing Git and Python, as Stable Diffusion requires those to work properly.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How much VRAM should i get?

A

16GB enables you to run Stable Diffusion faster and also allows you to produce larger images and more intricate results with it.
It can still work with less but slower and smaller files

https://okuha.com/vram-requirements-for-stable-diffusion/

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Can you run SD without a graphics card? (2)

A
  1. No, Stable Diffusion requires a graphics card to run.
  2. This is because the software relies on the GPU to calculate and render digital AI art.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly