Stable Diffusion Glossary Flashcards
what is a Checkpoint model
A checkpoint model is a more precise name for a Stable Diffusion model. It is used to distinguish from LoRA, textual inversion and Lycoris.
What is a CFG scale
The Classifier-Free Guidance scale controls how much the prompt should be followed in txt2img and img2img.
What is 4x-Ultrasharp (2)
4x-ultrasharp is a popular AI upscaler that produces sharp images.
It is popular among Stable Diffusion users.
What is an AI upscaler
An AI upscaler is an AI model that enlarges an image while adding details.
What is an Ancestral sampler (3)
- An ancestral sampler adds noise to the image at each sampling step.
- They are stochastic samplers because the sampling outcome has some randomness to it.
- They usually have a standalone letter “a” in their name. E.g. Euler a.
What is AnimateDiff (2)
- AnimateDiff is a text-to-video method for Stable Diffusion.
- It uses a motion control model to influence a Stable Diffusion model to generate a video as a sequence of images with motions.
What is Anything v3
Anything v3 is a celebrated anime-style Stable Diffusion model. It is a Stable Diffusion v1.5 model.
What is AUTOMATIC1111(3)
- AUTOMATIC1111 is a popular open-source, community-developed user interface for stable diffusion.
- AUTOMATIC1111 is the name of the user who started the project.
- The official project name is Stable Diffusion Web UI.
What is Civitai, and how can it be used? (4)
- Civitai is a website that holds a large number of Stable Diffusion models.
- You can use the AUTOMATIC1111 extension Civitai Helper to facilitate the download.
- Compared to Hugging Face, Civitai specializes in Stable Diffusion models.
- You can see many user-generated images there.
What is ComfyUI (2)
- ComfyUI is a node-based user interface for Stable Diffusion.
- It is popular among advanced Stable Diffusion users.
ControlNet (5)
- ControlNet is a neural network that controls image generation
- **by adding extra conditions. **
- You can use it to control human poses and image compositions.
- It is a major breakthrough in Stable Diffusion.
- There is :
ControlNet for v1 models
ControlNet for SDXL models
What is DDIM
Denoising Diffusion Implicit Models (DDIM) is one of the first samplers for solving diffusion models.
What is Denoising Strength
Denoising strength controls how much the image should change in the img2img process.
Diffusion
- Diffusion is an AI image-generation technique starting with a random image and gradually denoising it to a clear image.
- It is inspired by the Langevin dynamics formulation of the diffusion process in Physics.
WHat is DPM solver
Diffusion Probabilistic Model Solvers (DPM-Solvers) belong to a family of newly developed solvers for diffusion models.
What is Dreambooth (3)
- Dreambooth is a training technique to modify a checkpoint model.
- Needing** as few as 5 images**, you can use it to inject a person or a style into a model.
- A dreambooth model needs a trigger keyword in the prompt to trigger the injected subject or style.
What is EMA
- EMA stands for **Exponential Moving Average. **
- In a Stable Diffusion model, it is **the average weights over the last training steps. **
- Instead of the last training step,** a checkpoint model often use the EMA weights to improve stability**.
Embedding
- An embedding is** the product of textual inversion.**
- It is a small file for modifying an image.
- You apply embedding by putting in the associated keyword in the prompt or negative prompt.
Extension (2)
- An **extension extends the functionality of AUTOMATIC1111 WebUI. **
- eg, ControlNet is implemented through an extension.
What is Euler
The Euler method is the simplest sampling method for solving a diffusion model.
What is Fooocus (3)
- Fooocus is a Stable Diffusion software designed for simplicity.
- It centers the user experience on prompting and image generation.
- It’s free and open source.
What is Hugging Face (3)
- Hugging Face is a website that hosts a large amount of AI models.
- In addition, they develop tools to help run and host the models.
- Compared to Civitai, Hugging Face covers all AI models, not just Stable Diffusion.
What is Hypernetwork (2)
- A hypernetowrk is a small neural network that modifies the cross-attention module of the U-net noise predictor.
- Similar to LoRAs and embeddings, they are small model files used for modifying a checkpoint model.
What is the Karras noise schedule
- Karras is a noise schedule proposed in the Karras article, which studied a unified framework for denoising images in diffusion AI models.
What is K-diffusion/K-sampler
K-diffusion or K-samplers refer to sampling methods that Katherine Crowson’s k-diffusion GitHub repository implemented.
What is Latent diffusion
The Latent diffusion refers to a diffusion process in the latent space. For example, Stable Diffusion is a latent diffusion model.
What is LDM
The latent Diffusion Model (LDM) is an AI model that performs diffusion in the latent space.