App: https://github.com/vladmandic/sdnext/tree/master Version 2025-06-30 (ipex)
Model: https://huggingface.co/lodestones/Chroma
HW: MOREFINE S800, Intel Core Ultra 185H, 128 (2x64) GB DDR5 5600 CL46
Prompt: photorealistic girl in bookshop choosing the book in romantic stories shelf. smiling
Parameters: Steps: 50| Size: 768x768| Sampler: DPM2++ 2M FlowMatch| Seed: 2544969645| CFG scale: 7| Model: Chroma| App: SD.Next| Version: 0d7c025| Operations: txt2img| Pipeline: ChromaPipeline
Execution: Time: 21m 19.45s | total 1291.22 pipeline 1269.24 decode 6.51 offload 4.39 move 3.68 prompt 3.68 te 3.67 | GPU 27376 MB 21% | RAM 19.59 GB 16%
| STEPS: 2 | STEPS: 4 | STEPS: 8 | STEPS: 16 | STEPS: 20 | STEPS: 32 | STEPS: 50 | |
|---|---|---|---|---|---|---|---|
| CFG0 |
|
|
|
|
|
|
|
| CFG1 | same as CFG0 | same as CFG0 | same as CFG0 | same as CFG0 | same as CFG0 | same as CFG0 | same as CFG0 |
| CFG2 |
|
|
|
|
|
|
|
| CFG3 |
|
|
|
|
|
|
|
| CFG4 |
|
|
|
|
|
|
|
| CFG5 |
|
|
|
|
|
|
|
| CFG6 |
|
|
|
|
|
|
|
| CFG7 |
|
|
|
|
|
|
|
Prompt: Create a close-up photograph of a woman's face and hand, with her hand raised to her chin. She is wearing a white blazer and has a gold ring on her finger. Her nails are neatly manicured and her hair is pulled back into a low bun. She is smiling and has a radiant expression on her face. The background is a plain light gray color. The overall mood of the photo is elegant and sophisticated. The photo should have a soft, natural light and a slight warmth to it. The woman's hair is dark brown and pulled back into a low bun, with a few loose strands framing her face.
Parameters: Steps: 8| Size: 768x768| Sampler: DPM2++ 2M FlowMatch| Seed: 2544969645| Model: Chroma| App: SD.Next| Version: 0d7c025| Operations: txt2img| Pipeline: ChromaPipeline
Execution: Time: 1m 55.76s | total 127.52 pipeline 105.48 decode 6.58 offload 4.41 move 3.68 prompt 3.67 te 3.66 | GPU 27360 MB 21% | RAM 19.58 GB 16%
Prompt: Create a close-up photograph of a woman's face and hand, with her hand raised to her chin. She is wearing a white blazer and has a gold ring on her finger. Her nails are neatly manicured and her hair is pulled back into a low bun. She is smiling and has a radiant expression on her face. The background is a plain light gray color. The overall mood of the photo is elegant and sophisticated. The photo should have a soft, natural light and a slight warmth to it. The woman's hair is dark brown and pulled back into a low bun, with a few loose strands framing her face.
Parameters: Steps: 8| Size: 768x768| Sampler: DPM2++ 2M FlowMatch| Seed: 2544969645| CFG scale: 1.5| Model: Chroma| App: SD.Next| Version: 0d7c025| Operations: txt2img| Pipeline: ChromaPipeline
Execution: Time: 3m 33.19s | total 217.65 pipeline 206.56 decode 6.61 offload 4.44 | GPU 27360 MB 21% | RAM 19.59 GB 16%
| 8 | 16 | 20 | 32 | |
|---|---|---|---|---|
| CFG=1 |
|
|
| |
| CFG=1.5 |
|
|
| |
| CFG=2 |
|
|
| |
| CFG=4 |
|
|
| |
| CFG=8 |
|
|
|
Prompt: Generate a photo of a woman's legs, with her feet crossed and wearing white high-heeled shoes with ribbons tied around her ankles. The shoes should have a pointed toe and a stiletto heel. The woman's legs should be smooth and tanned, with a slight sheen to them. The background should be a light gray color. The photo should be taken from a low angle, looking up at the woman's legs. The ribbons should be tied in a bow shape around the ankles. The shoes should have a red sole. The woman's legs should be slightly bent at the knee.
| 8 | 16 | 20 | 32 | |
|---|---|---|---|---|
| CFG=1 | ||||
| CFG=1.5 | ||||
| CFG=2 | ||||
| CFG=4 | ||||
| CFG=8 |
app: sdnext.git updated: 2025-06-30 hash: 0d7c025a url: https://github.com/vladmandic/sdnext.git/tree/master arch: x86_64 cpu: x86_64 system: Linux release: 6.11.0-29-generic python: 3.12.3 Torch 2.7.1+xpu device: Intel(R) Arc(TM) Graphics (1) ipex: ram: free:122.36 used:2.97 total:125.33 xformers: diffusers: 0.35.0.dev0 transformers: 4.53.0 active: xpu dtype: torch.bfloat16 vae: torch.bfloat16 unet: torch.bfloat16 base: Diffusers/lodestones/Chroma [c98e7c057c] refiner: none vae: none te: none unet: none |
Model: Diffusers/lodestones/Chroma Type: chroma Class: ChromaPipeline Size: 0 bytes Modified: 2025-07-02 23:48:11 |
SD.Next dev 2025-06-29
Module | Class | Device | DType | Params | Modules | Config |
|---|---|---|---|---|---|---|
vae | AutoencoderKL | xpu:0 | torch.bfloat16 | 83819683 | 241 | FrozenDict({'in_channels': 3, 'out_channels': 3, 'down_block_types': ['DownEncoderBlock2D', 'DownEncoderBlock2D', 'DownEncoderBlock2D', 'DownEncoderBlock2D'], 'up_block_types': ['UpDecoderBlock2D', 'UpDecoderBlock2D', 'UpDecoderBlock2D', 'UpDecoderBlock2D'], 'block_out_channels': [128, 256, 512, 512], 'layers_per_block': 2, 'act_fn': 'silu', 'latent_channels': 16, 'norm_num_groups': 32, 'sample_size': 1024, 'scaling_factor': 0.3611, 'shift_factor': 0.1159, 'latents_mean': None, 'latents_std': None, 'force_upcast': True, 'use_quant_conv': False, 'use_post_quant_conv': False, 'mid_block_add_attention': True, '_class_name': 'AutoencoderKL', '_diffusers_version': '0.34.0.dev0', '_name_or_path': '/mnt/models/Diffusers/models--lodestones--Chroma/snapshots/c98e7c057c7e696b1c186c90ae0537b6dd7a81a7/vae'}) |
text_encoder | T5EncoderModel | xpu:0 | torch.bfloat16 | 4762310656 | 463 | T5Config { "architectures": [ "T5EncoderModel" ], "classifier_dropout": 0.0, "d_ff": 10240, "d_kv": 64, "d_model": 4096, "decoder_start_token_id": 0, "dense_act_fn": "gelu_new", "dropout_rate": 0.1, "eos_token_id": 1, "feed_forward_proj": "gated-gelu", "initializer_factor": 1.0, "is_encoder_decoder": true, "is_gated_act": true, "layer_norm_epsilon": 1e-06, "model_type": "t5", "num_decoder_layers": 24, "num_heads": 64, "num_layers": 24, "output_past": true, "pad_token_id": 0, "relative_attention_max_distance": 128, "relative_attention_num_buckets": 32, "tie_word_embeddings": false, "torch_dtype": "bfloat16", "transformers_version": "4.53.0", "use_cache": true, "vocab_size": 32128 } |
tokenizer | T5Tokenizer | None | None | 0 | 0 | None |
transformer | ChromaTransformer2DModel | xpu:0 | torch.bfloat16 | 8899983424 | 1144 | FrozenDict({'patch_size': 1, 'in_channels': 64, 'out_channels': None, 'num_layers': 19, 'num_single_layers': 38, 'attention_head_dim': 128, 'num_attention_heads': 24, 'joint_attention_dim': 4096, 'axes_dims_rope': [16, 56, 56], 'approximator_num_channels': 64, 'approximator_hidden_dim': 5120, 'approximator_layers': 5, '_class_name': 'ChromaTransformer2DModel', '_diffusers_version': '0.34.0.dev0', '_name_or_path': '/mnt/models/Diffusers/models--lodestones--Chroma/snapshots/c98e7c057c7e696b1c186c90ae0537b6dd7a81a7/transformer'}) |
scheduler | FlowMatchDPMSolverMultistepScheduler | None | None | 0 | 0 | FrozenDict({'num_train_timesteps': 1000, 'beta_start': 0.00085, 'beta_end': 0.012, 'beta_schedule': 'linear', 'trained_betas': None, 'solver_order': 2, 'algorithm_type': 'dpmsolver++2M', 'solver_type': 'midpoint', 'sigma_schedule': None, 'shift': 3, 'midpoint_ratio': 0.5, 's_noise': 1.0, 'use_noise_sampler': True, 'use_beta_sigmas': False, 'use_dynamic_shifting': False, 'base_shift': 0.5, 'max_shift': 1.15, 'base_image_seq_len': 256, 'max_image_seq_len': 4096, '_use_default_values': ['midpoint_ratio', 'max_image_seq_len', 'max_shift', 'base_image_seq_len', 'trained_betas', 's_noise', 'solver_type', 'base_shift']}) |
image_encoder | NoneType | None | None | 0 | 0 | None |
feature_extractor | NoneType | None | None | 0 | 0 | None |
_name_or_path | str | None | None | 0 | 0 | None |
_class_name | str | None | None | 0 | 0 | None |
_diffusers_version | str | None | None | 0 | 0 | None |