Info
https://huggingface.co/tencent/HunyuanImage-2.1
...
| Code Block |
|---|
# Examples of supported resolutions and aspect ratios for HunyuanImage-2.1:
# 16:9 -> width=2560, height=1536
# 4:3 -> width=2304, height=1792
# 1:1 -> width=2048, height=2048
# 3:4 -> width=1792, height=2304
# 9:16 -> width=1536, height=2560
# Please use one of the above width/height pairs for best results.
width=2048,
height=2048,
use_reprompt=False, # Enable prompt enhancement (which may result in higher GPU memory usage)
use_refiner=True, # Enable refiner model
# For the distilled model, use 8 steps for faster inference.
# For the non-distilled model, use 50 steps for better quality.
num_inference_steps=8 if "distilled" in model_name else 50,
guidance_scale=3.25 if "distilled" in model_name else 3.5,
shift=4 if "distilled" in model_name else 5,
seed=649151,
|
1024 vs 2048
| 1024 | 2048 |
|---|---|
Prompt: A cute, cartoon-style anthropomorphic penguin plush toy with fluffy fur, standing in a painting studio, wearing a red knitted scarf and a red beret with the word “Hunyuan Image” on it, holding a paintbrush with a focused expression as it paints an oil painting of the Mona Lisa, rendered in a photorealistic photographic style. Number 17B is handwritten over the image in the top left corner. Parameters: Steps: 8| Size: 1024x1024| Seed: 32| CFG scale: 3.25| App: SD.Next| Version: 187943c| Pipeline: HunyuanImagePipeline| Operations: txt2img| Model: HunyuanImage-2.1-Distilled-Diffusers Time: 2m 0.26s | total 148.30 pipeline 120.19 callback 18.66 te 7.46 vae 1.55 move 0.36 | GPU 52678 MB 41% | RAM 84.28 GB 67% | Prompt: A cute, cartoon-style anthropomorphic penguin plush toy with fluffy fur, standing in a painting studio, wearing a red knitted scarf and a red beret with the word “Hunyuan Image” on it, holding a paintbrush with a focused expression as it paints an oil painting of the Mona Lisa, rendered in a photorealistic photographic style. Number 17B is handwritten over the image in the top left corner. Parameters: Steps: 8| Size: 2048x2048| Seed: 32| CFG scale: 3.25| App: SD.Next| Version: 187943c| Pipeline: HunyuanImagePipeline| Operations: txt2img| Model: HunyuanImage-2.1-Distilled-Diffusers Time: 5m 52.39s | total 359.75 pipeline 352.28 te 4.51 vae 1.45 callback 1.38 | GPU 59348 MB 46% | RAM 90.14 GB 72% |
Test 0 - Different seed variations
Prompt: photorealistic girl in bookshop choosing the book in romantic stories shelf. smiling
...
Prompt: Generate a photo of a woman's legs, with her feet crossed and wearing white high-heeled shoes with ribbons tied around her ankles. The shoes should have a pointed toe and a stiletto heel. The woman's legs should be smooth and tanned, with a slight sheen to them. The background should be a light gray color. The photo should be taken from a low angle, looking up at the woman's legs. The ribbons should be tied in a bow shape around the ankles. The shoes should have a red sole. The woman's legs should be slightly bent at the knee.
1024px
| CFG3.5, STEP 50 | Seed: 1620085323 | Seed:1931701040 | Seed:4075624134 | Seed:2736029172 |
|---|---|---|---|---|
bookshop girl | ||||
| hand and face | ||||
| legs and shoes |
2048px
| CFG3.5, STEP 50 | Seed: 1620085323 | Seed:1931701040 | Seed:4075624134 | Seed:2736029172 |
|---|---|---|---|---|
bookshop girl | ||||
| hand and face | ||||
| legs and shoes |
Test 1 - Bookshop
Prompt: photorealistic girl in bookshop choosing the book in romantic stories shelf. smiling
...
| 4 | 8 | 16 | 32 | 64 | |
|---|---|---|---|---|---|
CFG1 CFG2 CFG3 CFG4 CFG5 CFG6 CFG8 |
Test 2 - Face and hand
Prompt: Create a close-up photograph of a woman's face and hand, with her hand raised to her chin. She is wearing a white blazer and has a gold ring on her finger. Her nails are neatly manicured and her hair is pulled back into a low bun. She is smiling and has a radiant expression on her face. The background is a plain light gray color. The overall mood of the photo is elegant and sophisticated. The photo should have a soft, natural light and a slight warmth to it. The woman's hair is dark brown and pulled back into a low bun, with a few loose strands framing her face.
| 8 | 16 | 20 | 32 | |
|---|---|---|---|---|
CFG3 |
Test 3 - Legs
Prompt: Generate a photo of a woman's legs, with her feet crossed and wearing white high-heeled shoes with ribbons tied around her ankles. The shoes should have a pointed toe and a stiletto heel. The woman's legs should be smooth and tanned, with a slight sheen to them. The background should be a light gray color. The photo should be taken from a low angle, looking up at the woman's legs. The ribbons should be tied in a bow shape around the ankles. The shoes should have a red sole. The woman's legs should be slightly bent at the knee.
| 8 | 16 | 20 | 32 | |
|---|---|---|---|---|
CFG3 |
Test 4 - Other model Covers
512px
1024px
2048px
System info
| Code Block |
|---|
Sat Oct 25 12:53:29 2025 app: sdnext.git updated: 2025-10-24 hash: 88ac83839 url: https://github.com/liutyi/sdnext.git/tree/pytorch arch: x86_64 cpu: x86_64 system: Linux release: 6.14.0-33-generic python: 3.12.3 python: 3.12.3 Torch: 2.9.0+xpu device: Intel(R) Arc(TM) Graphics (1) ipex: ram: free:119.7 used:5.63 total:125.33 xformers: diffusers: 0.36.0.dev0 transformers: 4.57.1 active: xpu dtype: torch.bfloat16 vae: torch.bfloat16 unet: torch.bfloat16 base: Diffusers/hunyuanvideo-community/HunyuanImage-2.1-Diffusers [7e7b7a177d] refiner: none vae: none te: none unet: none Backend: ipex Pipeline: native Memory optimization: none Cross-attention: Scaled-Dot-Product |
Config
| Code Block |
|---|
"huggingface_token": "hf_..FraU", "diffusers_version": "7536f647e4144c7acaf9e140893ff7edb85bf9a3", "sd_model_checkpoint": "hunyuanvideo-community/HunyuanImage-2.1-Diffusers", "sd_checkpoint_hash": null, "diffusers_to_gpu": true, "device_map": "gpu", "model_wan_stage": "combined", "diffusers_offload_mode": "none", "ui_request_timeout": 300000, "show_progress_type": "Simple" |
Model info
hunyuanvideo-community/HunyuanImage-2.1-Diffusers [7e7b7a177d]
...

