Stable Diffusion Glossary Flashcards

1
Q

what is a Checkpoint model

A

A checkpoint model is a more precise name for a Stable Diffusion model. It is used to distinguish from LoRA, textual inversion and Lycoris.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a CFG scale

A

The Classifier-Free Guidance scale controls how much the prompt should be followed in txt2img and img2img.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is 4x-Ultrasharp (2)

A

4x-ultrasharp is a popular AI upscaler that produces sharp images.
It is popular among Stable Diffusion users.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is an AI upscaler

A

An AI upscaler is an AI model that enlarges an image while adding details.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is an Ancestral sampler (3)

A
  1. An ancestral sampler adds noise to the image at each sampling step.
  2. They are stochastic samplers because the sampling outcome has some randomness to it.
  3. They usually have a standalone letter “a” in their name. E.g. Euler a.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is AnimateDiff (2)

A
  1. AnimateDiff is a text-to-video method for Stable Diffusion.
  2. It uses a motion control model to influence a Stable Diffusion model to generate a video as a sequence of images with motions.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Anything v3

A

Anything v3 is a celebrated anime-style Stable Diffusion model. It is a Stable Diffusion v1.5 model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is AUTOMATIC1111(3)

A
  1. AUTOMATIC1111 is a popular open-source, community-developed user interface for stable diffusion.
  2. AUTOMATIC1111 is the name of the user who started the project.
  3. The official project name is Stable Diffusion Web UI.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is Civitai, and how can it be used? (4)

A
  1. Civitai is a website that holds a large number of Stable Diffusion models.
  2. You can use the AUTOMATIC1111 extension Civitai Helper to facilitate the download.
  3. Compared to Hugging Face, Civitai specializes in Stable Diffusion models.
  4. You can see many user-generated images there.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is ComfyUI (2)

A
  1. ComfyUI is a node-based user interface for Stable Diffusion.
  2. It is popular among advanced Stable Diffusion users.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

ControlNet (5)

A
  1. ControlNet is a neural network that controls image generation
  2. **by adding extra conditions. **
  3. You can use it to control human poses and image compositions.
  4. It is a major breakthrough in Stable Diffusion.
  5. There is :
    ControlNet for v1 models
    ControlNet for SDXL models
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is DDIM

A

Denoising Diffusion Implicit Models (DDIM) is one of the first samplers for solving diffusion models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Denoising Strength

A

Denoising strength controls how much the image should change in the img2img process.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Diffusion

A
  1. Diffusion is an AI image-generation technique starting with a random image and gradually denoising it to a clear image.
  2. It is inspired by the Langevin dynamics formulation of the diffusion process in Physics.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

WHat is DPM solver

A

Diffusion Probabilistic Model Solvers (DPM-Solvers) belong to a family of newly developed solvers for diffusion models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is Dreambooth (3)

A
  1. Dreambooth is a training technique to modify a checkpoint model.
  2. Needing** as few as 5 images**, you can use it to inject a person or a style into a model.
  3. A dreambooth model needs a trigger keyword in the prompt to trigger the injected subject or style.
17
Q

What is EMA

A
  1. EMA stands for **Exponential Moving Average. **
  2. In a Stable Diffusion model, it is **the average weights over the last training steps. **
  3. Instead of the last training step,** a checkpoint model often use the EMA weights to improve stability**.
18
Q

Embedding

A
  1. An embedding is** the product of textual inversion.**
  2. It is a small file for modifying an image.
  3. You apply embedding by putting in the associated keyword in the prompt or negative prompt.
19
Q

Extension (2)

A
  1. An **extension extends the functionality of AUTOMATIC1111 WebUI. **
  2. eg, ControlNet is implemented through an extension.
20
Q

What is Euler

A

The Euler method is the simplest sampling method for solving a diffusion model.

21
Q

What is Fooocus (3)

A
  1. Fooocus is a Stable Diffusion software designed for simplicity.
  2. It centers the user experience on prompting and image generation.
  3. It’s free and open source.
22
Q

What is Hugging Face (3)

A
  1. Hugging Face is a website that hosts a large amount of AI models.
  2. In addition, they develop tools to help run and host the models.
  3. Compared to Civitai, Hugging Face covers all AI models, not just Stable Diffusion.
23
Q

What is Hypernetwork (2)

A
  1. A hypernetowrk is a small neural network that modifies the cross-attention module of the U-net noise predictor.
  2. Similar to LoRAs and embeddings, they are small model files used for modifying a checkpoint model.
24
Q

What is the Karras noise schedule

A
  1. Karras is a noise schedule proposed in the Karras article, which studied a unified framework for denoising images in diffusion AI models.
25
Q

What is K-diffusion/K-sampler

A

K-diffusion or K-samplers refer to sampling methods that Katherine Crowson’s k-diffusion GitHub repository implemented.

26
Q

What is Latent diffusion

A

The Latent diffusion refers to a diffusion process in the latent space. For example, Stable Diffusion is a latent diffusion model.

27
Q

What is LDM

A

The latent Diffusion Model (LDM) is an AI model that performs diffusion in the latent space.