Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Info

https://huggingface.co/tencent/HunyuanImage-2.1

...

Code Block
    # Examples of supported resolutions and aspect ratios for HunyuanImage-2.1:
    # 16:9  -> width=2560, height=1536
    # 4:3   -> width=2304, height=1792
    # 1:1   -> width=2048, height=2048
    # 3:4   -> width=1792, height=2304
    # 9:16  -> width=1536, height=2560
    # Please use one of the above width/height pairs for best results.
    width=2048,
    height=2048,
    use_reprompt=False,  # Enable prompt enhancement (which may result in higher GPU memory usage)
    use_refiner=True,   # Enable refiner model
    # For the distilled model, use 8 steps for faster inference.
    # For the non-distilled model, use 50 steps for better quality.
    num_inference_steps=8 if "distilled" in model_name else 50, 
    guidance_scale=3.25 if "distilled" in model_name else 3.5,
    shift=4 if "distilled" in model_name else 5,
    seed=649151,

1024 vs 2048

10242048

Prompt: A cute, cartoon-style anthropomorphic penguin plush toy with fluffy fur, standing in a painting studio, wearing a red knitted scarf and a red beret with the word “Hunyuan Image” on it, holding a paintbrush with a focused expression as it paints an oil painting of the Mona Lisa, rendered in a photorealistic photographic style. Number 17B is handwritten over the image in the top left corner.

Parameters: Steps: 50| Size: 1024x1024| Seed: 32| CFG scale: 3.5| App: SD.Next| Version: 88ac838| Pipeline: HunyuanImagePipeline| Operations: txt2img| Model: HunyuanImage-2.1-Diffusers

Time: 23m 2.80s | total 1507.64 pipeline 1382.77 callback 117.19 te 6.66 vae 0.99 | GPU 52508 MB 41% | RAM 62.13 GB 50%

Prompt: 

A cute, cartoon-style anthropomorphic penguin plush toy with fluffy fur, standing in a painting studio, wearing a red knitted scarf and a red beret with the word “Hunyuan Image” on it, holding a paintbrush with a focused expression as it paints an oil painting of the Mona Lisa, rendered in a photorealistic photographic style. Number 17B is handwritten over the image in the top left corner.

Parameters: Steps: 50| Size: 2048x2048| Seed: 32| CFG scale: 3.5| App: SD.Next| Version: 88ac838| Pipeline: HunyuanImagePipeline| Operations: txt2img| Model: HunyuanImage-2.1-Diffusers

Time: 66m 58.09s | total 4316.31 pipeline 4017.98 callback 285.59 te 9.64 vae 2.63 move 0.37 | GPU 58322 MB 45% | RAM 70.51 GB 56%

Test 0 - Different seed variations

Prompt: photorealistic girl in bookshop choosing the book in romantic stories shelf. smiling

...

Prompt: Generate a photo of a woman's legs, with her feet crossed and wearing white high-heeled shoes with ribbons tied around her ankles. The shoes should have a pointed toe and a stiletto heel. The woman's legs should be smooth and tanned, with a slight sheen to them. The background should be a light gray color. The photo should be taken from a low angle, looking up at the woman's legs. The ribbons should be tied in a bow shape around the ankles. The shoes should have a red sole. The woman's legs should be slightly bent at the knee.


1024px

CFG3.5, STEP 50Seed: 1620085323Seed:1931701040Seed:4075624134Seed:2736029172

bookshop girl

1024px

hand and face

legs and shoes

2048px

CFG3.5, STEP 50Seed: 1620085323Seed:1931701040Seed:4075624134Seed:2736029172

bookshop girl


Image Modified




hand and face



legs and shoes




Test 1 - Bookshop

Prompt: photorealistic girl in bookshop choosing the book in romantic stories shelf. smiling

...


48163264

CFG1






CFG2






CFG3






CFG4






CFG5






CFG6






CFG8







Test 2 - Face and hand

Prompt: Create a close-up photograph of a woman's face and hand, with her hand raised to her chin. She is wearing a white blazer and has a gold ring on her finger. Her nails are neatly manicured and her hair is pulled back into a low bun. She is smiling and has a radiant expression on her face. The background is a plain light gray color. The overall mood of the photo is elegant and sophisticated. The photo should have a soft, natural light and a slight warmth to it. The woman's hair is dark brown and pulled back into a low bun, with a few loose strands framing her face.

...


8162032

CFG1





CFG2





CFG3





CFG3.5





CFG4





CFG5





CFG8





Test 3 - Legs

Prompt: Generate a photo of a woman's legs, with her feet crossed and wearing white high-heeled shoes with ribbons tied around her ankles. The shoes should have a pointed toe and a stiletto heel. The woman's legs should be smooth and tanned, with a slight sheen to them. The background should be a light gray color. The photo should be taken from a low angle, looking up at the woman's legs. The ribbons should be tied in a bow shape around the ankles. The shoes should have a red sole. The woman's legs should be slightly bent at the knee.

...


8162032

CFG1





CFG2





CFG3





CFG3.5





CFG4





CFG5





CFG8





Test 4 - Other model Covers

1024px


System info


Code Block
Sat Oct 25 12:53:29 2025
app: sdnext.git updated: 2025-10-24 hash: 88ac83839 url: https://github.com/liutyi/sdnext.git/tree/pytorch
arch: x86_64 cpu: x86_64 system: Linux release: 6.14.0-33-generic
python: 3.12.3 python: 3.12.3 Torch: 2.9.0+xpu
device: Intel(R) Arc(TM) Graphics (1) ipex: 
ram: free:119.7 used:5.63 total:125.33
xformers: diffusers: 0.36.0.dev0 transformers: 4.57.1
active: xpu dtype: torch.bfloat16 vae: torch.bfloat16 unet: torch.bfloat16
base: hunyuanvideo-community/HunyuanImage-2.1-Diffusers refiner: none vae: none te: none unet: none
Backend: ipex Pipeline: native Memory optimization: none Cross-attention: Scaled-Dot-Product


Config

Code Block
  "huggingface_token": "hf_..FraU",
  "diffusers_version": "7536f647e4144c7acaf9e140893ff7edb85bf9a3",
  "sd_model_checkpoint": "hunyuanvideo-community/HunyuanImage-2.1-Diffusers",
  "sd_checkpoint_hash": null,
  "diffusers_to_gpu": true,
  "device_map": "gpu",
  "model_wan_stage": "combined",
  "diffusers_offload_mode": "none",
  "ui_request_timeout": 300000,
  "show_progress_type": "Simple"


Model info

Diffusers/lodestones/Chroma1-HD [ca9e916ceb]

...