Info

https://huggingface.co/Qwen/Qwen-Image
    num_inference_steps=50,
    true_cfg_scale=4.0,

Test 0 - Different seed variations

CFG6, STEP 20	Seed: 1620085323	Seed:1931701040	Seed:4075624134	Seed:2736029172
bookshop girl
hand and face
legs and shoes

Test 1 - Bookshop

Prompt: photorealistic girl in bookshop choosing the book in romantic stories shelf. smiling

	8	16	32	50	100
CFG1
CFG2
CFG3
CFG4
CFG5	skipped, cause CFG has no impact on the result
CFG6
CFG8

Test 2 - Face and hand - different attention guidance

Prompt: Create a close-up photograph of a woman's face and hand, with her hand raised to her chin. She is wearing a white blazer and has a gold ring on her finger. Her nails are neatly manicured and her hair is pulled back into a low bun. She is smiling and has a radiant expression on her face. The background is a plain light gray color. The overall mood of the photo is elegant and sophisticated. The photo should have a soft, natural light and a slight warmth to it. The woman's hair is dark brown and pulled back into a low bun, with a few loose strands framing her face.

	8	16	32	50
CFG2 AG 0
CFG2 AG 1
CFG2 AG 2
CFG2 AG 3
CFG2 AG 4
CFG2 AG 5

Test 3 - Legs

Prompt: Generate a photo of a woman's legs, with her feet crossed and wearing white high-heeled shoes with ribbons tied around her ankles. The shoes should have a pointed toe and a stiletto heel. The woman's legs should be smooth and tanned, with a slight sheen to them. The background should be a light gray color. The photo should be taken from a low angle, looking up at the woman's legs. The ribbons should be tied in a bow shape around the ankles. The shoes should have a red sole. The woman's legs should be slightly bent at the knee.

Time: 29m 43.41s | total 1784.61 pipeline 1783.38 te 1.17 | GPU 68698 MB 54% | RAM 3.21 GB 3%

	8	16	32	50
CFG2

Test 4 CivitAi profile cover generation

1600x400 profile imge for https://civitai.com/user/liutyi

DevOps > Test 40 - Qwen Image > 00073-2025-08-06-Qwen-Image-400x1600-Seed18009668-CFG6-STEP50.jpg

Prompt: image with 5 canvases and one android robot painting on the 3rd canvas. first two is done with some futuristic images, last two is blank. robot is surprised look back at the camera. paint brush is in the robot hand. Left bottom corner text "Qwen Image"

DevOps > Test 40 - Qwen Image > 00074-2025-08-06-Qwen-Image-400x1600-Seed1903536064-CFG4-STEP50.jpg

Prompt: with analog noise and glitches, CivitAi cover image with 5 canvases and one android robot painting on the 3rd canvas. first two is done with some futuristic images, last two is blank. robot is surprised look back at the camera. paint brush is in the robot hand.

DevOps > Test 40 - Qwen Image > 00075-2025-08-06-Qwen-Image-400x1600-Seed2833724372-CFG4-STEP50.jpg

Prompt: with analog noise and glitches, CivitAi cover image with 5 canvases and one android robot painting on the 3rd canvas. first two is done with some futuristic images, last two is blank. robot is surprised look back at the camera. paint brush is in the robot hand. Left bottom corner text "SD.Next + Qwen Image", upper right corner text "Stable Diffusion at home"

DevOps > Test 40 - Qwen Image > 00076-2025-08-06-Qwen-Image-400x1600-Seed823713865-CFG4-STEP50.jpg

Prompt: 80s cartoon style, CivitAi cover image with 5 canvases and one android robot painting on the 3rd canvas. first two is done with some futuristic images, last two is blank. robot is suprised look back at the camera. paint brush is in the robot hand. Left bottom corner text "SD.Next + Qwen Image", uper right corner text "Stable Diffusion at home"

DevOps > Test 40 - Qwen Image > 00077-2025-08-06-Qwen-Image-400x1600-Seed1672567154-CFG4-STEP50.jpg

Prompt: 3d render style, CivitAi cover image with 5 canvases and one android robot painting on the 3rd canvas. first two is done with some futuristic images, last two is blank. robot is surprised look back at the camera. paint brush is in the robot hand. Left bottom corner text "SD.Next + Qwen Image", uper right corner text "Stable Diffusion at home"

DevOps > Test 40 - Qwen Image > 00078-2025-08-06-Qwen-Image-400x1600-Seed3228978157-CFG4-STEP50.jpg

Prompt: Retro poster style, CivitAi cover image with 5 canvases and one android robot painting on the 3rd canvas. first two is done with some futuristic images, last two is blank. robot is surprised look back at the camera. paint brush is in the robot hand. Left bottom corner text "SD.Next + Qwen Image", upper right corner text "Stable Diffusion at home"

DevOps > Test 40 - Qwen Image > 00079-2025-08-06-Qwen-Image-400x1600-Seed1179249187-CFG4-STEP50.jpg

Prompt: LEGO style, CivitAi cover image with 5 canvases and one android robot painting on the 3rd canvas. first two is done with some futuristic images, last two is blank. robot is surprised look back at the camera. paint brush is in the robot hand. Left bottom corner text "SD.Next + Qwen Image", upper right corner text "Stable Diffusion at home"

DevOps > Test 40 - Qwen Image > 00080-2025-08-06-Qwen-Image-400x1600-Seed978455805-CFG4-STEP50.jpg

Prompt: CivitAi cover image with 5 canvases and one android robot painting on the 3rd canvas. first two is done with some futuristic images, last two is blank. robot is surprised look back at the camera. paint brush is in the robot hand. Left bottom corner text "SD.Next + Qwen Image", upper right corner text "Stable Diffusion at home"

System info

Tue Aug  5 20:35:18 2025
app: sdnext.git updated: 2025-08-04hash: 1d37a254 url: https://github.com/vladmandic/sdnext.git/tree/dev
arch: x86_64 cpu: x86_64 system: Linux release: 6.14.0-27-generic
python: 3.12.3 Torch: 2.7.1+xpu
device: Intel(R) Arc(TM) Graphics (1) ipex: 
ram: free:122.16 used:3.17 total:125.33
xformers:  diffusers: 0.35.0.dev0 transformers: 4.54.1
active: xpu dtype: torch.bfloat16 vae: torch.bfloat16 unet: torch.bfloat16 
base: Qwen/Qwen-Image refiner: None vae: Automatic te: Default unet: Default
Backend: ipex Cross-attention: Scaled-Dot-Product

Config

{
  "sd_model_checkpoint": "Qwen/Qwen-Image",
  "diffusers_to_gpu": true,
  "device_map": "gpu",
  "diffusers_offload_mode": "none",
  "samples_filename_pattern": "[seq]-[date]-[model_name]-[height]x[width]-Seed[seed]-CFG[cfg]-STEP[steps]",
  "diffusers_version": "7ea065c5070a5278259e6f1effa9dccea232e62a",
  "sd_checkpoint_hash": null
}

Model info

Module	Class	Device	DType	Params	Modules	Config
vae	AutoencoderKLQwenImage	xpu:0	torch.bfloat16	None	126892531	260	FrozenDict({'base_dim': 96, 'z_dim': 16, 'dim_mult': [1, 2, 4, 4], 'num_res_blocks': 2, 'attn_scales': [], 'temperal_downsample': [False, True, True], 'dropout': 0.0, 'latents_mean': [-0.7571, -0.7089, -0.9113, 0.1075, -0.1745, 0.9653, -0.1517, 1.5508, 0.4134, -0.0715, 0.5517, -0.3632, -0.1922, -0.9497, 0.2503, -0.2921], 'latents_std': [2.8184, 1.4541, 2.3275, 2.6558, 1.2196, 1.7708, 2.6052, 2.0743, 3.2687, 2.1526, 2.8652, 1.5579, 1.6382, 1.1253, 2.8251, 1.916], '_class_name': 'AutoencoderKLQwenImage', '_diffusers_version': '0.34.0.dev0', '_name_or_path': '/mnt/models/Diffusers/models--Qwen--Qwen-Image/snapshots/4516c4d3058302ff35cd86c62ffa645d039fefad/vae'})
text_encoder	Qwen2_5_VLForConditionalGeneration	xpu:0	torch.bfloat16	None	8292166656	763	Qwen2_5_VLConfig { "architectures": [ "Qwen2_5_VLForConditionalGeneration" ], "attention_dropout": 0.0, "bos_token_id": 151643, "eos_token_id": 151645, "hidden_act": "silu", "hidden_size": 3584, "image_token_id": 151655, "initializer_range": 0.02, "intermediate_size": 18944, "max_position_embeddings": 128000, "max_window_layers": 28, "model_type": "qwen2_5_vl", "num_attention_heads": 28, "num_hidden_layers": 28, "num_key_value_heads": 4, "rms_norm_eps": 1e-06, "rope_scaling": { "mrope_section": [ 16, 24, 24 ], "rope_type": "default", "type": "default" }, "rope_theta": 1000000.0, "sliding_window": 32768, "text_config": { "architectures": [ "Qwen2_5_VLForConditionalGeneration" ], "attention_dropout": 0.0, "bos_token_id": 151643, "eos_token_id": 151645, "hidden_act": "silu", "hidden_size": 3584, "image_token_id": null, "initializer_range": 0.02, "intermediate_size": 18944, "layer_types": [ "full_attention", "full_attention", "full_attention", "full_attention", "full_attention", "full_attention", "full_attention", "full_attention", "full_attention", "full_attention", "full_attention", "full_attention", "full_attention", "full_attention", "full_attention", "full_attention", "full_attention", "full_attention", "full_attention", "full_attention", "full_attention", "full_attention", "full_attention", "full_attention", "full_attention", "full_attention", "full_attention", "full_attention" ], "max_position_embeddings": 128000, "max_window_layers": 28, "model_type": "qwen2_5_vl_text", "num_attention_heads": 28, "num_hidden_layers": 28, "num_key_value_heads": 4, "rms_norm_eps": 1e-06, "rope_scaling": { "mrope_section": [ 16, 24, 24 ], "rope_type": "default", "type": "default" }, "rope_theta": 1000000.0, "sliding_window": null, "torch_dtype": "bfloat16", "use_cache": true, "use_sliding_window": false, "video_token_id": null, "vision_end_token_id": 151653, "vision_start_token_id": 151652, "vision_token_id": 151654, "vocab_size": 152064 }, "tie_word_embeddings": false, "torch_dtype": "bfloat16", "transformers_version": "4.54.1", "use_cache": true, "use_sliding_window": false, "video_token_id": 151656, "vision_config": { "depth": 32, "fullatt_block_indexes": [ 7, 15, 23, 31 ], "hidden_act": "silu", "hidden_size": 1280, "in_channels": 3, "in_chans": 3, "initializer_range": 0.02, "intermediate_size": 3420, "model_type": "qwen2_5_vl", "num_heads": 16, "out_hidden_size": 3584, "patch_size": 14, "spatial_merge_size": 2, "spatial_patch_size": 14, "temporal_patch_size": 2, "tokens_per_second": 2, "torch_dtype": "bfloat16", "window_size": 112 }, "vision_end_token_id": 151653, "vision_start_token_id": 151652, "vision_token_id": 151654, "vocab_size": 152064 }
tokenizer	Qwen2Tokenizer	None	None	None	0	0	None
transformer	QwenImageTransformer2DModel	xpu:0	torch.bfloat16	None	20430401088	2297	FrozenDict({'patch_size': 2, 'in_channels': 64, 'out_channels': 16, 'num_layers': 60, 'attention_head_dim': 128, 'num_attention_heads': 24, 'joint_attention_dim': 3584, 'guidance_embeds': False, 'axes_dims_rope': [16, 56, 56], '_class_name': 'QwenImageTransformer2DModel', '_diffusers_version': '0.34.0.dev0', 'pooled_projection_dim': 768, '_name_or_path': 'Qwen/Qwen-Image'})
scheduler	FlowMatchEulerDiscreteScheduler	None	None	None	0	0	FrozenDict({'num_train_timesteps': 1000, 'shift': 1.0, 'use_dynamic_shifting': True, 'base_shift': 0.5, 'max_shift': 0.9, 'base_image_seq_len': 256, 'max_image_seq_len': 8192, 'invert_sigmas': False, 'shift_terminal': 0.02, 'use_karras_sigmas': False, 'use_exponential_sigmas': False, 'use_beta_sigmas': False, 'time_shift_type': 'exponential', 'stochastic_sampling': False, '_class_name': 'FlowMatchEulerDiscreteScheduler', '_diffusers_version': '0.34.0.dev0'})
_name_or_path	str	None	None	None	0	0	None
_class_name	str	None	None	None	0	0	None
_diffusers_version	str	None	None	None	0	0	None