Models that is work
| Model | size | time | |
|---|---|---|---|
| dreamshaperXL_v21TurboDPMSDE | 6.5G | 1.5m | |
| TempestV0.1-Artistic | 6.5G | 2m | |
| dreamshaper_8 | 2.0G | ||
ostris--Flex.1-alpha | 25G | 7m | |
| kandinsky-community--kandinsky-3 | 27G | 22m | 512x512 -ok, 1024 -not |
| segmind---SD-4x2-v0 | 3.7G | ||
| XCLiu--instaflow_0_9B_from_sd_1_5 | 5.2G | ||
juggernautXL_ragnarokBy | 6.7G | 2m | |
| juggernaut_reborn | 2.0G | not finished | |
| meinamix_v12Final | 2.0G | 6m | need more than 20 steps |
| ponyRealism_V23 | 6.5G | 2m | |
| prefectPonyXL_v50 | 6.5G | 2m | |
epicrealism_naturalSin | 6.5G | 2m | need more than 20 steps |
counterfeitV30_25 | 4.0G | 5m |
ByteDance/Hyper-SD root@server6:~/ollama-cpu-docker# du -xsh /docker/sdnext/mnt-volume/models/Diffusers/* |sort -n 1.0G /docker/sdnext/mnt-volume/models/Diffusers/models--segmind--tiny-sd 3.3G /docker/sdnext/mnt-volume/models/Diffusers/models--amused--amused-512 + 3.7G /docker/sdnext/mnt-volume/models/Diffusers/models--segmind--SegMoE-SD-4x2-v0 5.2G /docker/sdnext/mnt-volume/models/Diffusers/models--stablediffusionapi--anything-v5 5.2G /docker/sdnext/mnt-volume/models/Diffusers/models--stablediffusionapi--disney-pixal-cartoon + 5.2G /docker/sdnext/mnt-volume/models/Diffusers/models--XCLiu--instaflow_0_9B_from_sd_1_5 - 9.1G /docker/sdnext/mnt-volume/models/Diffusers/models--Efficient-Large-Model--Sana_Sprint_1.6B_1024px_diffusers 13G /docker/sdnext/mnt-volume/models/Diffusers/models--playgroundai--playground-v2-256px-base - 14G /docker/sdnext/mnt-volume/models/Diffusers/models--callgg--bagel-fp8 - 21G /docker/sdnext/mnt-volume/models/Diffusers/models--PixArt-alpha--PixArt-XL-2-512x512 + 25G /docker/sdnext/mnt-volume/models/Diffusers/models--ostris--Flex.1-alpha - 26G /docker/sdnext/mnt-volume/models/Diffusers/models--ByteDance--Hyper-SD - 26G /docker/sdnext/mnt-volume/models/Diffusers/models--stabilityai--stable-diffusion-3.5-large - 26G /docker/sdnext/mnt-volume/models/Diffusers/models--stabilityai--stable-diffusion-3.5-large-turbo + 27G /docker/sdnext/mnt-volume/models/Diffusers/models--kandinsky-community--kandinsky-3 ? 41G /docker/sdnext/mnt-volume/models/Diffusers/models--ByteDance--InfiniteYou ? 47G /docker/sdnext/mnt-volume/models/Diffusers/models--ByteDance--SDXL-Lightning ? 66G /docker/sdnext/mnt-volume/models/Diffusers/models--calcuis--sd3.5-large-gguf root@server6:~/ollama-cpu-docker# du -xsh /docker/sdnext/mnt-volume/models/Stable-diffusion/*.safetensors |sort -n + 2.0G /docker/sdnext/mnt-volume/models/Stable-diffusion/dreamshaper_8.safetensors + 2.0G /docker/sdnext/mnt-volume/models/Stable-diffusion/epicrealism_naturalSin.safetensors + 2.0G /docker/sdnext/mnt-volume/models/Stable-diffusion/juggernaut_reborn.safetensors + 2.0G /docker/sdnext/mnt-volume/models/Stable-diffusion/meinamix_v12Final.safetensors + 4.0G /docker/sdnext/mnt-volume/models/Stable-diffusion/counterfeitV30_25.safetensors ? 4.8G /docker/sdnext/mnt-volume/models/Stable-diffusion/stableDiffusion35_medium.safetensors + 6.5G /docker/sdnext/mnt-volume/models/Stable-diffusion/dreamshaperXL_v21TurboDPMSDE.safetensors + 6.5G /docker/sdnext/mnt-volume/models/Stable-diffusion/ponyDiffusionV6XL_v6TurboMerge.safetensors + 6.5G /docker/sdnext/mnt-volume/models/Stable-diffusion/ponyRealism_V23.safetensors + 6.5G /docker/sdnext/mnt-volume/models/Stable-diffusion/prefectPonyXL_v50.safetensors + 6.5G /docker/sdnext/mnt-volume/models/Stable-diffusion/TempestV0.1-Artistic.safetensors + 6.7G /docker/sdnext/mnt-volume/models/Stable-diffusion/juggernautXL_ragnarokBy.safetensors - 13G /docker/sdnext/mnt-volume/models/Stable-diffusion/ponyDiffusionV6XL_v6StartWithThisOne.safetensors ? 14G /docker/sdnext/mnt-volume/models/Stable-diffusion/absynthEnhancedStable_35M20T5xxlFp16Clipl.safetensors ? 16G /docker/sdnext/mnt-volume/models/Stable-diffusion/flux_dev.safetensors - 41G /docker/sdnext/mnt-volume/models/Stable-diffusion/sd35LargeGoogleFLAN_large3CLIPFLANFP16.safetensors |
crazy frog on the moon
Asian girl with white rabbit in New York Central Park
Sports car in faraway winter village get towed by coca-cola truck
Президент України на центральній площі столиці
| Model | crazy frog on the moon | Asian girl with white rabbit in New York Central Park | Sports car in faraway winter village get towed by coca-cola truck | Президент України на центральній площі столиці |
|---|---|---|---|---|
dreamshaperXL_v21TurboDPMSDE Model Size: 6.5G Image Size: 1024x1024, CFG scale: 6, Model: dreamshaperXL_v21TurboDPMSDE, Model hash: 4496b36d48, App: SD.Next, Version: 12ebadc, Operations: txt2img, Pipeline: StableDiffusionXLPipeline |
Time: 4m 10.85 |
Time: 4m 5.84s |
Time: 4m 5.55s |
Time: 4m 5.27s |
dreamshaperXL_v21TurboDPMSDE Model Size: 6.5G Image Size: 1024x1024, (OpenVINO) |
Time: 1m 25.62s |
Time: 1m 27.45s |
Time: 1m 27.33s |
Time: 1m 27.47s |
ostris--Flex.1-alpha Model Size: 6.5G Image Size: 1024x1024,
|
Time: 6m 52.95s |
Time: 6m 27.95s |
Time: 6m 26.53s | |
TempestV0.1-Artistic Model Size: 6.5G Image Size: 1024x1024,
|
Time: 1m 45.31s |
Time: 1m 46.13s |
Time: 1m 45.31s |
Time: 1m 47.05s |
dreamshaper_8 Model Size: 2.0G Image Size: 1024x1024, |
Time: 5m 53.64s |
|
|
|
kandinsky-community--kandinsky-3 Model Size: 27G
|
Time: 22.38s |
Time: 22.41s |
Time: 22.96s |
Time: 22.93s |
segmind---SD-4x2-v0 Model Size: 3.7G Image Size: 1024x1024, |
time=868.29 (other high load) |
Time: 5m 55.77s |
Time: 6m 25.73s |
Time: 6m 35.28s |
XCLiu--instaflow_0_9B_from_sd_1_5 Model Size: 5.2G Image Size: 1024x1024, |
Time: 13m 47.34s |
Time: 13m 48.29s |
Time: 13m 54.21s |
Time: 13m 54.96s |
juggernautXL_ragnarokBy Model Size: 6.7G Image Size: 1024x1024,
|
Time: 1m 45.56s |
Time: 1m 45.57s |
Time: 1m 45.27s |
Time: 1m 45.51s |
juggernaut_reborn Model Size: 2.0G Image Size: 1024x1024, |
Time: 7m 51.51s |
Time: 7m 12.42s |
Time: 6m 32.72s |
Time: 6m 12.55s |
meinamix_v12Final Model Size: 2G Image Size: 1024x1024, |
7m 5.59s |
Time: 5m 50.81s |
6m 18.95s |
Time: 14m 30.83s (background tasks) |
ponyRealism_V23 Model Size: 6.2G Image Size: 1024x1024, |
Time: 1m 48.85s |
Time: 1m 45.83s |
Time: 1m 45.75s |
Time: 1m 45.75s +neg. prompt (no nudes) |
prefectPonyXL_v50 Model Size: 6.2G Image Size: 1024x1024,
|
Time: 1m 45.26s |
Time: 1m 45.37s |
Time: 1m 45.31s |
Time: 1m 45.46s |
ponyDiffusionV6XL_v6TurboMerge Model Size: 6.5G Image Size: 1024x1024,
|
Time: 1m 46.17s |
Time: 1m 46.72s |
Time: 1m 46.04s |
Time: 4m 14.59s |
epicrealism_naturalSin Model Size: 2G Image Size: 1024x1024, |
Time: 5m 49.91s (20 step) |
Time: 5m 50.01s (20 steps, CFG=6) |
Time: 8m 34.48s (30 steps, CFG=7.5) |
Time: 8m 37.26s (30 steps, CFG=7.5) |
counterfeitV30_25Model Size: 4G Image Size: 1024x1024, |
|
Time: 5m 50.00s |
Time: 5m 49.99s |
Time: 10m 58.72s |
| ostris--Flex.1-alpha Model Size: 25G Image Size: 1024x1024, | - | - | - | - |
Efficient-Large-Model–Sana_Sprint_1.6B_1024px_diffuser Model Size: 9.1G | - | - | - | - |
kandinsky-community--kandinsky-3 Model Size: 27G |
| - | - | - |
PixArt-alpha--PixArt-XL-2-512x512 Model Size: 21G Size: 512x512, | - | - | - | - |
amused--amused-512 Model Size: 3.3G Image Size: 512x512,
| 'list' object has no attribute 'device' | 'list' object has no attribute 'device' | 'list' object has no attribute 'device' | 'list' object has no attribute 'device' |
playgroundai--playground-v2-256px-base Model Size: 13G Image Size: 1024x1024, Image Size: 512x512, Image Size: 256x256,
| ||||
disney-pixal-cartoon | Time: 6m 6.92s |
SD Versions info from wiki
https://en.wikipedia.org/wiki/Stable_Diffusion
| Version number | Release date | Parameters | Notes |
|---|---|---|---|
| 1.1, 1.2, 1.3, 1.4 | August 2022 | All released by CompVis. There is no "version 1.0". 1.1 gave rise to 1.2, and 1.2 gave rise to both 1.3 and 1.4. | |
| 1.5 | October 2022 | 983M | Initialized with the weights of 1.2, not 1.4. Released by RunwayML. |
| 2.0 | November 2022 | Retrained from scratch on a filtered dataset. | |
| 2.1 | December 2022 | Initialized with the weights of 2.0. | |
| XL 1.0 | July 2023 | 3.5B | The XL 1.0 base model has 3.5 billion parameters, making it around 3.5x larger than previous versions. |
| XL Turbo | November 2023 | Distilled from XL 1.0 to run in fewer diffusion steps. | |
| 3.0 | February 2024 (early preview) | 800M to 8B | A family of models. |
| 3.5 | October 2024 | 2.5B to 8B | A family of models with Large (8 billion parameters), Large Turbo (distilled from SD 3.5 Large), and Medium (2.5 billion parameters). |
SD.Next Doc Models
https://vladmandic.github.io/sdnext-docs/Models/
| Publisher | Model | Version | Size | Diffusion Architecture | Model Params | Text Encoder(s) | TE Params | Auto Encoder | Other |
|---|---|---|---|---|---|---|---|---|---|
| StabilityAI | Stable Diffusion | 1.5 | 2.28GB | UNet | 0.86B | CLiP ViT-L | 0.12B | VAE | |
| StabilityAI | Stable Diffusion | 2.1 | 2.58GB | UNet | 0.86B | CLiP ViT-H | 0.34B | VAE | |
| StabilityAI | Stable Diffusion | XL | 6.94GB | UNet | 2.56B | CLiP ViT-L + ViT+G | 0.12B + 0.69B | VAE | |
| StabilityAI | Stable Diffusion | 3.0 Medium | 15.14GB | MMDiT | 2.0B | CLiP ViT-L + ViT+G + T5-XXL | 0.12B + 0.69B + 4.76B | 16ch VAE | |
| StabilityAI | Stable Diffusion | 3.5 Medium | 15.89GB | MMDiT | 2.25B | CLiP ViT-L + ViT+G + T5-XXL | 0.12B + 0.69B + 4.76B | 16ch VAE | |
| StabilityAI | Stable Diffusion | 3.5 Large | 26.98GB | MMDiT | 8.05B | CLiP ViT-L + ViT+G + T5-XXL | 0.12B + 0.69B + 4.76B | 16ch VAE | |
| StabilityAI | Stable Cascade | Medium | 11.82GB | Multi-stage UNet | 1.56B + 3.6B | CLiP ViT-G | 0.69B | 42x VQE | |
| StabilityAI | Stable Cascade | Lite | 4.97GB | Multi-stage UNet | 0.7B + 1.0B | CLiP ViT-G | 0.69B | 42x VQE | |
| Black Forest Labs | Flux | 1 Dev/Schnell | 32.93GB | MMDiT | 11.9B | CLiP ViT-L + T5-XXL | 0.12B + 4.76B | 16ch VAE | |
| Ostris | Flex | 1 Alpha | 25.65GB | MMDiT | 4.0B | CLiP ViT-L + T5-XXL | 0.12B + 2.95B | 16ch VAE | |
| NVLabs | Sana | 1.5 1.6B | 9.49GB | MMDiT | 1.60B | Gemma2 | 2.61B | DC-AE | |
| NVLabs | Sana | 1.5 4.8B | 15.58GB | MMDiT | 4.72B | Gemma2 | 2.61B | DC-AE | |
| NVLabs | Sana | 1.0 1600M | 12.63GB | MMDiT | 1.60B | Gemma2 | 2.61B | DC-AE | |
| NVLabs | Sana | 1.0 600M | 7.51GB | MMDiT | 0.59B | Gemma2 | 2.61B | DC-AE | |
| FAL | AuraFlow | 0.3 | 31.90GB | MMDiT | 6.8B | UMT5 | 12.1B | VAE | |
| AlphaVLLM | Lumina | Next SFT | 8.67GB | DiT | 1.7B | Gemma | 2.5B | VAE | |
| AlphaVLLM | Lumina | 2 | 20.75GB | DiT | 2.61B | Gemma-2 | 2.61B | 16ch VAE | |
| PixArt | Alpha | XL 2 | 21.3GB | DiT | 0.61B | T5-XXL | 4.76B | VAE | |
| PixArt | Sigma | XL 2 | 21.3GB | DiT | 0.61B | T5-XXL | 4.76B | VAE | |
| Segmind | SSD-1B | N/A | 8.72GB | UNet | 1.33B | CLiP ViT-L + ViT+G | 0.12B + 0.69B | VAE | |
| Segmind | Vega | N/A | 6.43GB | UNet | 0.75B | CLiP ViT-L + ViT+G | 0.12B + 0.69B | VAE | |
| Segmind | Tiny | N/A | 1.03GB | UNet | 0.32B | CLiP ViT-L | 0.12B | VAE | |
| Kwai | Kolors | N/A | 17.40GB | UNnet | 2.58B | ChatGLM | 6.24B | VAE | |
| PlaygroundAI | Playground | 1.0 | 4.95GB | UNet | 0.86B | CLiP ViT-L | 0.12B | VAE | |
| PlaygroundAI | Playground | 2.x | 13.35GB | UNet | 2.56B | CLiP ViT-L + ViT+G | 0.12B + 0.69B | VAE | |
| Tencent | HunyuanDiT | 1.2 | 14.09GB | DiT | 1.5B | BERT + T5-XL | 3.52B + 1.67B | VAE | |
| Warp AI | Wuerstchen | N/A | 12.16GB | Multi-stage UNet | 1.0B + 1.05B | CLiP ViT-L + ViT+G | 0.12B + 0.69B | 42x VQE | |
| Kandinsky | Kandinsky | 2.2 | 5.15GB | Unet | 1.25B | CLiP ViT-G | 0.69B | VQ | |
| Kandinsky | Kandinsky | 3.0 | 27.72GB | Unet | 3.05B | T5-XXXL | 8.72B | VQ | |
| Thudm | CogView | 3 Plus | 24.96GB | DiT | 2.85B | T5-XXL | 4.76B | VAE | |
| Thudm | CogView | 4 | 30.39GB | DiT | 6.37B | GLM-4 | 9.40B | VAE | |
| IDKiro | SDXS | N/A | 2.05GB | UNet | 0.32B | CLiP ViT-L | 0.12B | VAE | |
| Open-MUSE | aMUSEd | 256 | 3.41GB | ViT | 0.60B | CLiP ViT-L | 0.12B | VQ | |
| Koala | Koala | 700M | 6.58GB | UNet | 0.78B | CLiP ViT-L + ViT+G | 0.12B + 0.69B | VAE | |
| Thu-ML | UniDiffuser | v1 | 5.37GB | U-ViT | 0.95B | CLiP ViT-L + CLiP ViT-B | 0.12B + 0.16B | VAE | |
| Salesforce | BLIP-Diffusion | N/A | 7.23GB | UNet | 0.86B | CLiP ViT-L + BLiP-2 | 0.12B + 0.49B | VAE | |
| DeepFloyd | IF | M | 12.79GB | Multi-stage UNet | 0.37B + 0.46B | T5-XXL | 4.76B | Pixel | |
| DeepFloyd | IF | L | 15.48GB | Multi-stage UNet | 0.61B + 0.93B | T5-XXL | 4.76B | Pixel | |
| MeissonFlow | Meissonic | N/A | 3.64GB | DiT | 1.18B | CLiP ViT-H | 0.35B | VQ | |
| VectorSpaceLab | OmniGen | v1 | 15.47GB | Transformer | 3.76B | None | 0 | VAE | Phi-3 |
| HiDream-AI | HiDream | I2 Fast/Dev/Full | 42.71 GB + 15.69 | MMDiT | 17.10B | CLiP ViT-L + ViT+G + T5-XXL + LLama-3.1-8B | 0.12B + 0.69B + 2.95B + 4.54B | 16ch VAE |