You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 17 Next »

Info

https://huggingface.co/lodestones/Chroma1-HD

        num_inference_steps=40,
        guidance_scale=3.0,
        negative_prompt =  ["low quality, ugly, unfinished, out of focus, deformed, disfigure, blurry, smudged, restricted palette, flat colors"]

Test 0 - Different seed variations

Prompt: photorealistic girl in bookshop choosing the book in romantic stories shelf. smiling

Negative: low quality, ugly, unfinished, out of focus

Parameters: Steps: 32| Size: 1024x1024| Seed: 1620085323| CFG scale: 3| App: SD.Next| Version: 5bd6111| Pipeline: ChromaPipeline| Operations: txt2img| Model: Chroma1-HD


Prompt: Create a close-up photograph of a woman's face and hand, with her hand raised to her chin. She is wearing a white blazer and has a gold ring on her finger. Her nails are neatly manicured and her hair is pulled back into a low bun. She is smiling and has a radiant expression on her face. The background is a plain light gray color. The overall mood of the photo is elegant and sophisticated. The photo should have a soft, natural light and a slight warmth to it. The woman's hair is dark brown and pulled back into a low bun, with a few loose strands framing her face.

Negative: low quality, ugly, unfinished, out of focus

Parameters: Steps: 32| Size: 1024x1024| Seed: 1931701040| CFG scale: 3| App: SD.Next| Version: 5bd6111| Pipeline: ChromaPipeline| Operations: txt2img| Model: Chroma1-HD


Prompt: Generate a photo of a woman's legs, with her feet crossed and wearing white high-heeled shoes with ribbons tied around her ankles. The shoes should have a pointed toe and a stiletto heel. The woman's legs should be smooth and tanned, with a slight sheen to them. The background should be a light gray color. The photo should be taken from a low angle, looking up at the woman's legs. The ribbons should be tied in a bow shape around the ankles. The shoes should have a red sole. The woman's legs should be slightly bent at the knee.

Negative: low quality, ugly, unfinished, out of focus

Parameters: Steps: 32| Size: 1024x1024| Seed: 4075624134| CFG scale: 3| App: SD.Next| Version: 5bd6111| Pipeline: ChromaPipeline| Operations: txt2img| Model: Chroma1-HD


CFG3, STEP 32Seed: 1620085323Seed:1931701040Seed:4075624134Seed:2736029172
bookshop girl

hand and face

legs and shoes

Test 1 - Bookshop

Prompt: photorealistic girl in bookshop choosing the book in romantic stories shelf. smiling

Parameters: Steps: 32| Size: 1024x1024| Seed: 1620085323| CFG scale: 3| App: SD.Next| Version: 5bd6111| Pipeline: ChromaPipeline| Operations: txt2img| Model: Chroma1-HD


48163264

CFG1


CFG2

CFG3

CFG4

CFG5

CFG6

CFG8


Test 2 - Face and hand

Prompt: Create a close-up photograph of a woman's face and hand, with her hand raised to her chin. She is wearing a white blazer and has a gold ring on her finger. Her nails are neatly manicured and her hair is pulled back into a low bun. She is smiling and has a radiant expression on her face. The background is a plain light gray color. The overall mood of the photo is elegant and sophisticated. The photo should have a soft, natural light and a slight warmth to it. The woman's hair is dark brown and pulled back into a low bun, with a few loose strands framing her face.



8162032

CFG1

CFG2

CFG3

CFG3.5

CFG4

CFG5

CFG8

Test 3 - Legs

w/o negative prompt

Prompt: Generate a photo of a woman's legs, with her feet crossed and wearing white high-heeled shoes with ribbons tied around her ankles. The shoes should have a pointed toe and a stiletto heel. The woman's legs should be smooth and tanned, with a slight sheen to them. The background should be a light gray color. The photo should be taken from a low angle, looking up at the woman's legs. The ribbons should be tied in a bow shape around the ankles. The shoes should have a red sole. The woman's legs should be slightly bent at the knee.

Negative prompt added *

Prompt: Generate a photo of a woman's legs, with her feet crossed and wearing white high-heeled shoes with ribbons tied around her ankles. The shoes should have a pointed toe and a stiletto heel. The woman's legs should be smooth and tanned, with a slight sheen to them. The background should be a light gray color. The photo should be taken from a low angle, looking up at the woman's legs. The ribbons should be tied in a bow shape around the ankles. The shoes should have a red sole. The woman's legs should be slightly bent at the knee.

Negative: low quality, ugly, unfinished, out of focus, deformed, disfigure, blurry, smudged, restricted palette, flat colors

Parameters: Steps: 20| Size: 1024x1024| Seed: 4075624134| CFG scale: 3| App: SD.Next| Version: a39404c| Pipeline: ChromaPipeline| Operations: txt2img| Model: Chroma1-HD

Time: 24m 43.33s | total 1391.66 pipeline 1125.90 callback 165.28 preview 86.17 decode 6.40 prompt 3.79 te 3.79 gc 0.29 | GPU 28974 MB 23% | RAM 35.74 GB 29%


8162032

CFG1

CFG2

CFG2.5

CFG3

CFG3*
+ neg
prmpt



CFG4

CFG5

CFG8

Test 4 - Other model Covers

with default negative prompt added

System info


Sun Oct 19 22:14:17 2025
app: sdnext.git updated: 2025-10-19 hash: 5bd6111f1 url: https://github.com/liutyi/sdnext.git/tree/pytorch
arch: x86_64 cpu: x86_64 system: Linux release: 6.14.0-33-generic
python: 3.12.3 python: 3.12.3 Torch: 2.9.0+xpu
device: Intel(R) Arc(TM) Graphics (1) ipex: 
ram: free:121.99 used:3.35 total:125.34
xformers: diffusers: 0.36.0.dev0 transformers: 4.57.1
active: xpu dtype: torch.bfloat16 vae: torch.bfloat16 unet: torch.bfloat16
base: Diffusers/lodestones/Chroma1-HD [ca9e916ceb] refiner: none vae: none te: none unet: none
Backend: ipex Pipeline: native Memory optimization: none Cross-attention: Scaled-Dot-Product


Config

{
  "diffusers_version": "23ebbb4bc81a17ebea17cb7cb94f301199e49a7f",
  "sd_model_checkpoint": "Diffusers/lodestones/Chroma1-HD [ca9e916ceb]",
  "sd_checkpoint_hash": null,
  "diffusers_to_gpu": true,
  "device_map": "gpu",
  "diffusers_offload_mode": "none",
  "ui_request_timeout": 300000
}


Model info

Diffusers/lodestones/Chroma1-HD [ca9e916ceb]

ModuleClassDeviceDtypeQuantParamsModulesConfig
vaeAutoencoderKLxpu:0torch.bfloat16None83819683241

FrozenDict({'in_channels': 3, 'out_channels': 3, 'down_block_types': ['DownEncoderBlock2D', 'DownEncoderBlock2D', 'DownEncoderBlock2D', 'DownEncoderBlock2D'], 'up_block_types': ['UpDecoderBlock2D', 'UpDecoderBlock2D', 'UpDecoderBlock2D', 'UpDecoderBlock2D'], 'block_out_channels': [128, 256, 512, 512], 'layers_per_block': 2, 'act_fn': 'silu', 'latent_channels': 16, 'norm_num_groups': 32, 'sample_size': 1024, 'scaling_factor': 0.3611, 'shift_factor': 0.1159, 'latents_mean': None, 'latents_std': None, 'force_upcast': True, 'use_quant_conv': False, 'use_post_quant_conv': False, 'mid_block_add_attention': True, '_class_name': 'AutoencoderKL', '_diffusers_version': '0.35.1', '_name_or_path': '/mnt/models/Diffusers/models--lodestones--Chroma1-HD/snapshots/ca9e916cebf4fa9dfca429caf2e3f724aad7094d/vae'})

text_encoderT5EncoderModelxpu:0torch.bfloat16None4762310656463

T5Config { "architectures": [ "T5EncoderModel" ], "classifier_dropout": 0.0, "d_ff": 10240, "d_kv": 64, "d_model": 4096, "decoder_start_token_id": 0, "dense_act_fn": "gelu_new", "dropout_rate": 0.1, "dtype": "bfloat16", "eos_token_id": 1, "feed_forward_proj": "gated-gelu", "initializer_factor": 1.0, "is_encoder_decoder": false, "is_gated_act": true, "layer_norm_epsilon": 1e-06, "model_type": "t5", "num_decoder_layers": 24, "num_heads": 64, "num_layers": 24, "output_past": true, "pad_token_id": 0, "relative_attention_max_distance": 128, "relative_attention_num_buckets": 32, "tie_word_embeddings": false, "transformers_version": "4.57.1", "use_cache": false, "vocab_size": 32128 }

tokenizerT5TokenizerNoneNoneNone00

None

transformerChromaTransformer2DModelxpu:0torch.bfloat16None88999834241144

FrozenDict({'patch_size': 1, 'in_channels': 64, 'out_channels': None, 'num_layers': 19, 'num_single_layers': 38, 'attention_head_dim': 128, 'num_attention_heads': 24, 'joint_attention_dim': 4096, 'axes_dims_rope': [16, 56, 56], 'approximator_num_channels': 64, 'approximator_hidden_dim': 5120, 'approximator_layers': 5, '_class_name': 'ChromaTransformer2DModel', '_diffusers_version': '0.35.1', 'guidance_embeds': False, 'pooled_projection_dim': 768, '_name_or_path': 'lodestones/Chroma1-HD'})

schedulerFlowMatchEulerDiscreteSchedulerNoneNoneNone00

FrozenDict({'num_train_timesteps': 1000, 'shift': 3.0, 'use_dynamic_shifting': False, 'base_shift': 0.5, 'max_shift': 1.15, 'base_image_seq_len': 256, 'max_image_seq_len': 4096, 'invert_sigmas': False, 'shift_terminal': None, 'use_karras_sigmas': False, 'use_exponential_sigmas': False, 'use_beta_sigmas': False, 'time_shift_type': 'exponential', 'stochastic_sampling': False, '_class_name': 'FlowMatchEulerDiscreteScheduler', '_diffusers_version': '0.35.1'})

image_encoderNoneTypeNoneNoneNone00

None

feature_extractorNoneTypeNoneNoneNone00

None


  • No labels