Models that is work
| Model | size | time | |
|---|---|---|---|
| dreamshaperXL_v21TurboDPMSDE | 6.5G | 1.5m | |
| TempestV0.1-Artistic | 6.5G | 2m | |
| dreamshaper_8 | 2.0G | ||
ostris--Flex.1-alpha | 25G | 7m | |
| kandinsky-community--kandinsky-3 | 27G | 22m | 512x512 -ok, 1024 -not |
| segmind---SD-4x2-v0 | 3.7G | ||
| XCLiu--instaflow_0_9B_from_sd_1_5 | 5.2G | ||
juggernautXL_ragnarokBy | 6.7G | 2m | |
| juggernaut_reborn | 2.0G | not finished | |
| meinamix_v12Final | 2.0G | 6m | need more than 20 steps |
| ponyRealism_V23 | 6.5G | 2m | |
| prefectPonyXL_v50 | 6.5G | 2m | |
epicrealism_naturalSin | 6.5G | 2m | need more than 20 steps |
counterfeitV30_25 | 4.0G | 5m |
ByteDance/Hyper-SD root@server6:~/ollama-cpu-docker# du -xsh /docker/sdnext/mnt-volume/models/Diffusers/* |sort -n 1.0G /docker/sdnext/mnt-volume/models/Diffusers/models--segmind--tiny-sd 3.3G /docker/sdnext/mnt-volume/models/Diffusers/models--amused--amused-512 + 3.7G /docker/sdnext/mnt-volume/models/Diffusers/models--segmind--SegMoE-SD-4x2-v0 5.2G /docker/sdnext/mnt-volume/models/Diffusers/models--stablediffusionapi--anything-v5 5.2G /docker/sdnext/mnt-volume/models/Diffusers/models--stablediffusionapi--disney-pixal-cartoon + 5.2G /docker/sdnext/mnt-volume/models/Diffusers/models--XCLiu--instaflow_0_9B_from_sd_1_5 - 9.1G /docker/sdnext/mnt-volume/models/Diffusers/models--Efficient-Large-Model--Sana_Sprint_1.6B_1024px_diffusers 13G /docker/sdnext/mnt-volume/models/Diffusers/models--playgroundai--playground-v2-256px-base - 14G /docker/sdnext/mnt-volume/models/Diffusers/models--callgg--bagel-fp8 - 21G /docker/sdnext/mnt-volume/models/Diffusers/models--PixArt-alpha--PixArt-XL-2-512x512 + 25G /docker/sdnext/mnt-volume/models/Diffusers/models--ostris--Flex.1-alpha - 26G /docker/sdnext/mnt-volume/models/Diffusers/models--ByteDance--Hyper-SD - 26G /docker/sdnext/mnt-volume/models/Diffusers/models--stabilityai--stable-diffusion-3.5-large - 26G /docker/sdnext/mnt-volume/models/Diffusers/models--stabilityai--stable-diffusion-3.5-large-turbo + 27G /docker/sdnext/mnt-volume/models/Diffusers/models--kandinsky-community--kandinsky-3 ? 41G /docker/sdnext/mnt-volume/models/Diffusers/models--ByteDance--InfiniteYou ? 47G /docker/sdnext/mnt-volume/models/Diffusers/models--ByteDance--SDXL-Lightning ? 66G /docker/sdnext/mnt-volume/models/Diffusers/models--calcuis--sd3.5-large-gguf root@server6:~/ollama-cpu-docker# du -xsh /docker/sdnext/mnt-volume/models/Stable-diffusion/*.safetensors |sort -n + 2.0G /docker/sdnext/mnt-volume/models/Stable-diffusion/dreamshaper_8.safetensors + 2.0G /docker/sdnext/mnt-volume/models/Stable-diffusion/epicrealism_naturalSin.safetensors + 2.0G /docker/sdnext/mnt-volume/models/Stable-diffusion/juggernaut_reborn.safetensors + 2.0G /docker/sdnext/mnt-volume/models/Stable-diffusion/meinamix_v12Final.safetensors + 4.0G /docker/sdnext/mnt-volume/models/Stable-diffusion/counterfeitV30_25.safetensors ? 4.8G /docker/sdnext/mnt-volume/models/Stable-diffusion/stableDiffusion35_medium.safetensors + 6.5G /docker/sdnext/mnt-volume/models/Stable-diffusion/dreamshaperXL_v21TurboDPMSDE.safetensors + 6.5G /docker/sdnext/mnt-volume/models/Stable-diffusion/ponyDiffusionV6XL_v6TurboMerge.safetensors + 6.5G /docker/sdnext/mnt-volume/models/Stable-diffusion/ponyRealism_V23.safetensors + 6.5G /docker/sdnext/mnt-volume/models/Stable-diffusion/prefectPonyXL_v50.safetensors + 6.5G /docker/sdnext/mnt-volume/models/Stable-diffusion/TempestV0.1-Artistic.safetensors + 6.7G /docker/sdnext/mnt-volume/models/Stable-diffusion/juggernautXL_ragnarokBy.safetensors - 13G /docker/sdnext/mnt-volume/models/Stable-diffusion/ponyDiffusionV6XL_v6StartWithThisOne.safetensors ? 14G /docker/sdnext/mnt-volume/models/Stable-diffusion/absynthEnhancedStable_35M20T5xxlFp16Clipl.safetensors ? 16G /docker/sdnext/mnt-volume/models/Stable-diffusion/flux_dev.safetensors - 41G /docker/sdnext/mnt-volume/models/Stable-diffusion/sd35LargeGoogleFLAN_large3CLIPFLANFP16.safetensors
Random prompts used in this test
crazy frog on the moon
Asian girl with white rabbit in New York Central Park
Sports car in faraway winter village get towed by coca-cola truck
Президент України на центральній площі столиці
Defaults (All SD.Next App parameters left in defaults)
- Batch count =1
- Batch size = 1
- Guidance end = 1
- Guidance scale = 6
- Steps = 20
- Sampling method = default
| Model | crazy frog on the moon | Asian girl with white rabbit in New York Central Park | Sports car in faraway winter village get towed by coca-cola truck | Президент України на центральній площі столиці |
|---|---|---|---|---|
dreamshaperXL_v21TurboDPMSDE Model Size: 6.5G Image Size: 1024x1024, CFG scale: 6, Model: dreamshaperXL_v21TurboDPMSDE, Model hash: 4496b36d48, App: SD.Next, Version: 12ebadc, Operations: txt2img, Pipeline: StableDiffusionXLPipeline | Time: 4m 10.85 | Time: 4m 5.84s | Time: 4m 5.55s | Time: 4m 5.27s |
dreamshaperXL_v21TurboDPMSDE Model Size: 6.5G Image Size: 1024x1024, (OpenVINO) | Time: 1m 25.62s | Time: 1m 27.45s | Time: 1m 27.33s | Time: 1m 27.47s |
ostris--Flex.1-alpha Model Size: 6.5G Image Size: 1024x1024,
| Time: 6m 52.95s | Time: 6m 27.95s | Time: 6m 26.53s | Time: 6m 27.70s |
TempestV0.1-Artistic Model Size: 6.5G Image Size: 1024x1024,
| Time: 1m 45.31s | Time: 1m 46.13s | Time: 1m 45.31s | Time: 1m 47.05s |
dreamshaper_8 Model Size: 2.0G Image Size: 1024x1024, | Time: 5m 53.64s | |||
kandinsky-community--kandinsky-3 Model Size: 27G
| Time: 22.38s | Time: 22.41s | Time: 22.96s | Time: 22.93s |
segmind---SD-4x2-v0 Model Size: 3.7G Image Size: 1024x1024, | time=868.29 (other high load) | Time: 5m 55.77s | Time: 6m 25.73s | Time: 6m 35.28s |
segmind--tiny-sd Model Size: 1G Image Size: 1024x1024, | Time: 3m 25.01s | Time: 3m 25.15s | Time: 3m 24.95s | Time: 3m 26.57s |
XCLiu--instaflow_0_9B_from_sd_1_5 Model Size: 5.2G Image Size: 1024x1024, | Time: 13m 47.34s | Time: 13m 48.29s | Time: 13m 54.21s | Time: 13m 54.96s |
juggernautXL_ragnarokBy Model Size: 6.7G Image Size: 1024x1024,
| Time: 1m 45.56s | Time: 1m 45.57s | Time: 1m 45.27s | Time: 1m 45.51s |
juggernaut_reborn Model Size: 2.0G Image Size: 1024x1024, | Time: 7m 51.51s | Time: 7m 12.42s | Time: 6m 32.72s | Time: 6m 12.55s |
meinamix_v12Final Model Size: 2G Image Size: 1024x1024, | 7m 5.59s | Time: 5m 50.81s | 6m 18.95s | Time: 14m 30.83s (background tasks) |
ponyRealism_V23 Model Size: 6.2G Image Size: 1024x1024, | Time: 1m 48.85s | Time: 1m 45.83s | Time: 1m 45.75s | Time: 1m 45.75s +neg. prompt (no nudes) |
prefectPonyXL_v50 Model Size: 6.2G Image Size: 1024x1024,
| Time: 1m 45.26s | Time: 1m 45.37s | Time: 1m 45.31s | Time: 1m 45.46s |
ponyDiffusionV6XL_v6TurboMerge Model Size: 6.5G Image Size: 1024x1024,
| Time: 1m 46.17s | Time: 1m 46.72s | Time: 1m 46.04s | Time: 4m 14.59s |
epicrealism_naturalSin Model Size: 2G Image Size: 1024x1024, | Time: 5m 49.91s (20 step) | Time: 5m 50.01s (20 steps, CFG=6) | Time: 8m 34.48s (30 steps, CFG=7.5) | Time: 8m 37.26s (30 steps, CFG=7.5) |
counterfeitV30_25Model Size: 4G Image Size: 1024x1024, | Time: 5m 50.00s | Time: 5m 49.99s | Time: 10m 58.72s | |
Efficient-Large-Model–Sana_Sprint_1.6B_1024px_diffuser Model Size: 9.1G | - | - | - | - |
kandinsky-community--kandinsky-3 Model Size: 27G |
| - | - | - |
PixArt-alpha--PixArt-XL-2-512x512 Model Size: 21G Size: 512x512, | - | - | - | - |
amused--amused-512 Model Size: 3.3G Image Size: 512x512,
| 'list' object has no attribute 'device' | 'list' object has no attribute 'device' | 'list' object has no attribute 'device' | 'list' object has no attribute 'device' |
playgroundai--playground-v2-256px-base Model Size: 13G Image Size: 1024x1024, Image Size: 512x512, Image Size: 256x256,
| ||||
disney-pixal-cartoon | Time: 6m 6.92s |
SD Versions info from wiki
https://en.wikipedia.org/wiki/Stable_Diffusion
| Version number | Release date | Parameters | Notes |
|---|---|---|---|
| 1.1, 1.2, 1.3, 1.4 | August 2022 | All released by CompVis. There is no "version 1.0". 1.1 gave rise to 1.2, and 1.2 gave rise to both 1.3 and 1.4. | |
| 1.5 | October 2022 | 983M | Initialized with the weights of 1.2, not 1.4. Released by RunwayML. |
| 2.0 | November 2022 | Retrained from scratch on a filtered dataset. | |
| 2.1 | December 2022 | Initialized with the weights of 2.0. | |
| XL 1.0 | July 2023 | 3.5B | The XL 1.0 base model has 3.5 billion parameters, making it around 3.5x larger than previous versions. |
| XL Turbo | November 2023 | Distilled from XL 1.0 to run in fewer diffusion steps. | |
| 3.0 | February 2024 (early preview) | 800M to 8B | A family of models. |
| 3.5 | October 2024 | 2.5B to 8B | A family of models with Large (8 billion parameters), Large Turbo (distilled from SD 3.5 Large), and Medium (2.5 billion parameters). |
SD.Next Doc Models
https://vladmandic.github.io/sdnext-docs/Models/
| Publisher | Model | Version | Size | Diffusion Architecture | Model Params | Text Encoder(s) | TE Params | Auto Encoder | Other |
|---|---|---|---|---|---|---|---|---|---|
| StabilityAI | Stable Diffusion | 1.5 | 2.28GB | UNet | 0.86B | CLiP ViT-L | 0.12B | VAE | |
| StabilityAI | Stable Diffusion | 2.1 | 2.58GB | UNet | 0.86B | CLiP ViT-H | 0.34B | VAE | |
| StabilityAI | Stable Diffusion | XL | 6.94GB | UNet | 2.56B | CLiP ViT-L + ViT+G | 0.12B + 0.69B | VAE | |
| StabilityAI | Stable Diffusion | 3.0 Medium | 15.14GB | MMDiT | 2.0B | CLiP ViT-L + ViT+G + T5-XXL | 0.12B + 0.69B + 4.76B | 16ch VAE | |
| StabilityAI | Stable Diffusion | 3.5 Medium | 15.89GB | MMDiT | 2.25B | CLiP ViT-L + ViT+G + T5-XXL | 0.12B + 0.69B + 4.76B | 16ch VAE | |
| StabilityAI | Stable Diffusion | 3.5 Large | 26.98GB | MMDiT | 8.05B | CLiP ViT-L + ViT+G + T5-XXL | 0.12B + 0.69B + 4.76B | 16ch VAE | |
| StabilityAI | Stable Cascade | Medium | 11.82GB | Multi-stage UNet | 1.56B + 3.6B | CLiP ViT-G | 0.69B | 42x VQE | |
| StabilityAI | Stable Cascade | Lite | 4.97GB | Multi-stage UNet | 0.7B + 1.0B | CLiP ViT-G | 0.69B | 42x VQE | |
| Black Forest Labs | Flux | 1 Dev/Schnell | 32.93GB | MMDiT | 11.9B | CLiP ViT-L + T5-XXL | 0.12B + 4.76B | 16ch VAE | |
| Ostris | Flex | 1 Alpha | 25.65GB | MMDiT | 4.0B | CLiP ViT-L + T5-XXL | 0.12B + 2.95B | 16ch VAE | |
| NVLabs | Sana | 1.5 1.6B | 9.49GB | MMDiT | 1.60B | Gemma2 | 2.61B | DC-AE | |
| NVLabs | Sana | 1.5 4.8B | 15.58GB | MMDiT | 4.72B | Gemma2 | 2.61B | DC-AE | |
| NVLabs | Sana | 1.0 1600M | 12.63GB | MMDiT | 1.60B | Gemma2 | 2.61B | DC-AE | |
| NVLabs | Sana | 1.0 600M | 7.51GB | MMDiT | 0.59B | Gemma2 | 2.61B | DC-AE | |
| FAL | AuraFlow | 0.3 | 31.90GB | MMDiT | 6.8B | UMT5 | 12.1B | VAE | |
| AlphaVLLM | Lumina | Next SFT | 8.67GB | DiT | 1.7B | Gemma | 2.5B | VAE | |
| AlphaVLLM | Lumina | 2 | 20.75GB | DiT | 2.61B | Gemma-2 | 2.61B | 16ch VAE | |
| PixArt | Alpha | XL 2 | 21.3GB | DiT | 0.61B | T5-XXL | 4.76B | VAE | |
| PixArt | Sigma | XL 2 | 21.3GB | DiT | 0.61B | T5-XXL | 4.76B | VAE | |
| Segmind | SSD-1B | N/A | 8.72GB | UNet | 1.33B | CLiP ViT-L + ViT+G | 0.12B + 0.69B | VAE | |
| Segmind | Vega | N/A | 6.43GB | UNet | 0.75B | CLiP ViT-L + ViT+G | 0.12B + 0.69B | VAE | |
| Segmind | Tiny | N/A | 1.03GB | UNet | 0.32B | CLiP ViT-L | 0.12B | VAE | |
| Kwai | Kolors | N/A | 17.40GB | UNnet | 2.58B | ChatGLM | 6.24B | VAE | |
| PlaygroundAI | Playground | 1.0 | 4.95GB | UNet | 0.86B | CLiP ViT-L | 0.12B | VAE | |
| PlaygroundAI | Playground | 2.x | 13.35GB | UNet | 2.56B | CLiP ViT-L + ViT+G | 0.12B + 0.69B | VAE | |
| Tencent | HunyuanDiT | 1.2 | 14.09GB | DiT | 1.5B | BERT + T5-XL | 3.52B + 1.67B | VAE | |
| Warp AI | Wuerstchen | N/A | 12.16GB | Multi-stage UNet | 1.0B + 1.05B | CLiP ViT-L + ViT+G | 0.12B + 0.69B | 42x VQE | |
| Kandinsky | Kandinsky | 2.2 | 5.15GB | Unet | 1.25B | CLiP ViT-G | 0.69B | VQ | |
| Kandinsky | Kandinsky | 3.0 | 27.72GB | Unet | 3.05B | T5-XXXL | 8.72B | VQ | |
| Thudm | CogView | 3 Plus | 24.96GB | DiT | 2.85B | T5-XXL | 4.76B | VAE | |
| Thudm | CogView | 4 | 30.39GB | DiT | 6.37B | GLM-4 | 9.40B | VAE | |
| IDKiro | SDXS | N/A | 2.05GB | UNet | 0.32B | CLiP ViT-L | 0.12B | VAE | |
| Open-MUSE | aMUSEd | 256 | 3.41GB | ViT | 0.60B | CLiP ViT-L | 0.12B | VQ | |
| Koala | Koala | 700M | 6.58GB | UNet | 0.78B | CLiP ViT-L + ViT+G | 0.12B + 0.69B | VAE | |
| Thu-ML | UniDiffuser | v1 | 5.37GB | U-ViT | 0.95B | CLiP ViT-L + CLiP ViT-B | 0.12B + 0.16B | VAE | |
| Salesforce | BLIP-Diffusion | N/A | 7.23GB | UNet | 0.86B | CLiP ViT-L + BLiP-2 | 0.12B + 0.49B | VAE | |
| DeepFloyd | IF | M | 12.79GB | Multi-stage UNet | 0.37B + 0.46B | T5-XXL | 4.76B | Pixel | |
| DeepFloyd | IF | L | 15.48GB | Multi-stage UNet | 0.61B + 0.93B | T5-XXL | 4.76B | Pixel | |
| MeissonFlow | Meissonic | N/A | 3.64GB | DiT | 1.18B | CLiP ViT-H | 0.35B | VQ | |
| VectorSpaceLab | OmniGen | v1 | 15.47GB | Transformer | 3.76B | None | 0 | VAE | Phi-3 |
| HiDream-AI | HiDream | I2 Fast/Dev/Full | 42.71 GB + 15.69 | MMDiT | 17.10B | CLiP ViT-L + ViT+G + T5-XXL + LLama-3.1-8B | 0.12B + 0.69B + 2.95B + 4.54B | 16ch VAE |