Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Info


Info

DRAFT ARTICLE


https://huggingface.co/tencent/SRPO

Test 0 - Different seed variations

Prompt: photorealistic girl in bookshop choosing the book in romantic stories shelf. smiling

...

CFG6, STEP 32Seed: 1620085323Seed:1931701040Seed:4075624134Seed:2736029172
bookshop girl

hand and face

legs and shoes

Test 1 - Bookshop

Prompt: photorealistic girl in bookshop choosing the book in romantic stories shelf. smiling


48162032

CFG1






CFG2






CFG3






CFG4






CFG5






CFG6






CFG8







Test 2 - Face and hand

Prompt: Create a close-up photograph of a woman's face and hand, with her hand raised to her chin. She is wearing a white blazer and has a gold ring on her finger. Her nails are neatly manicured and her hair is pulled back into a low bun. She is smiling and has a radiant expression on her face. The background is a plain light gray color. The overall mood of the photo is elegant and sophisticated. The photo should have a soft, natural light and a slight warmth to it. The woman's hair is dark brown and pulled back into a low bun, with a few loose strands framing her face.

...


8162032

CFG1





CFG2





CFG3





CFG3.5





CFG4





CFG5





CFG8





Test 3 - Legs

Prompt: Generate a photo of a woman's legs, with her feet crossed and wearing white high-heeled shoes with ribbons tied around her ankles. The shoes should have a pointed toe and a stiletto heel. The woman's legs should be smooth and tanned, with a slight sheen to them. The background should be a light gray color. The photo should be taken from a low angle, looking up at the woman's legs. The ribbons should be tied in a bow shape around the ankles. The shoes should have a red sole. The woman's legs should be slightly bent at the knee.

...


8162032

CFG1





CFG2





CFG3





CFG3.5





CFG4





CFG5





CFG8





Test 4 CivitAi profile cover generation

1600x400


Prompt: image with 5 canvases and one android robot painting on the 3rd canvas. first two is done with some futuristic images, last two is blank. robot is surprised look back at the camera. paint brush is in the robot hand. Left bottom corner text "Flux.1 Krea"

...

Parameters: Steps: 20| Size: 1600x400| Seed: 1620085323| CFG scale: 6| App: SD.Next| Version: e90ac68| Pipeline: FluxPipeline| Operations: txt2img| Model: FLUX.1-dev

Test 5 CFG 6 vs CFG 3.5

CFG3.5

CFG 6

System info


Code Block
Mon Sep 22 09:38:44 2025
app: sdnext.git updated: 2025-09-21 hash: 82504e96 url: https://github.com/vladmandic/sdnext.git/tree/dev
arch: x86_64 cpu: x86_64 system: Linux release: 6.14.0-29-generic python: 3.12.3

Torch: 2.7.1+xpu device: Intel(R) Arc(TM) Graphics (1) ipex: 2.7.10+xpu

ram: free:121.28 used:4.05 total:125.33
gpu: free:81.62 used:35.75 total:117.37
gpu-active: current:31.63 peak:33.68
gpu-allocated: current:31.63 peak:33.68
gpu-reserved: current:35.75 peak:35.75
gpu-inactive: current:0.13 peak:0.74
events: retries:0 oom:0
utilization: 0

xformers: diffusers: 0.36.0.dev0 transformers: 4.56.1
active: xpu dtype: torch.bfloat16 vae: torch.bfloat16 unet: torch.bfloat16
base: vladmandic/flux.1-dev-SRPO refiner: none vae: none te: none unet: none
Backend: ipex Cross-attention: Scaled-Dot-Product


Config

Code Block
{
  "theme_type": "Standard",
  "diffusers_version": "efb7a299af46d739dec6a57a5d2814165fba24b5",
  "diffusers_offload_mode": "none",
  "ui_request_timeout": 300000,
  "huggingface_token": "hf_..FraU",
  "samples_filename_pattern": "[date]-[seq]-[model_name]-[height]x[width]-Seed[seed]-CFG[cfg]-AG[pag]-STEP[steps]",
  "sd_model_checkpoint": "vladmandic/flux.1-dev-SRPO",
  "sd_checkpoint_hash": null,
  "hf_transfer_mode": "xet",
  "extra_networks_sort": "sort: no cards",
  "schedulers_solver_order": 1,
  "schedulers_sigma": "flowmatch",
  "schedulers_prediction_type": "flow_prediction",
  "schedulers_timestep_spacing": "leading",
  "schedulers_beta_schedule": "scaled",
  "cuda_compile_backend": "none",
  "diffusers_to_gpu": true,
  "device_map": "gpu",
  "queue_paused": true
}


Model info


ModuleClassDeviceDTypeParamsModulesConfig
vaeAutoencoderKLxpu:0torch.bfloat16None83819683241

FrozenDict({'in_channels': 3, 'out_channels': 3, 'down_block_types': ['DownEncoderBlock2D', 'DownEncoderBlock2D', 'DownEncoderBlock2D', 'DownEncoderBlock2D'], 'up_block_types': ['UpDecoderBlock2D', 'UpDecoderBlock2D', 'UpDecoderBlock2D', 'UpDecoderBlock2D'], 'block_out_channels': [128, 256, 512, 512], 'layers_per_block': 2, 'act_fn': 'silu', 'latent_channels': 16, 'norm_num_groups': 32, 'sample_size': 1024, 'scaling_factor': 0.3611, 'shift_factor': 0.1159, 'latents_mean': None, 'latents_std': None, 'force_upcast': True, 'use_quant_conv': False, 'use_post_quant_conv': False, 'mid_block_add_attention': True, '_class_name': 'AutoencoderKL', '_diffusers_version': '0.36.0.dev0', '_name_or_path': '/mnt/models/Diffusers/models--vladmandic--flux.1-dev-SRPO/snapshots/690eab4805ea624e20d1abbe4318afba7cccd6b1/vae'})

text_encoderCLIPTextModelxpu:0torch.bfloat16None123060480152

CLIPTextConfig { "architectures": [ "CLIPTextModel" ], "attention_dropout": 0.0, "bos_token_id": 0, "dropout": 0.0, "dtype": "bfloat16", "eos_token_id": 2, "hidden_act": "quick_gelu", "hidden_size": 768, "initializer_factor": 1.0, "initializer_range": 0.02, "intermediate_size": 3072, "layer_norm_eps": 1e-05, "max_position_embeddings": 77, "model_type": "clip_text_model", "num_attention_heads": 12, "num_hidden_layers": 12, "pad_token_id": 1, "projection_dim": 768, "transformers_version": "4.56.1", "vocab_size": 49408 }

text_encoder_2T5EncoderModelxpu:0torch.bfloat16None4762310656463

T5Config { "architectures": [ "T5EncoderModel" ], "classifier_dropout": 0.0, "d_ff": 10240, "d_kv": 64, "d_model": 4096, "decoder_start_token_id": 0, "dense_act_fn": "gelu_new", "dropout_rate": 0.1, "dtype": "bfloat16", "eos_token_id": 1, "feed_forward_proj": "gated-gelu", "initializer_factor": 1.0, "is_encoder_decoder": false, "is_gated_act": true, "layer_norm_epsilon": 1e-06, "model_type": "t5", "num_decoder_layers": 24, "num_heads": 64, "num_layers": 24, "output_past": true, "pad_token_id": 0, "relative_attention_max_distance": 128, "relative_attention_num_buckets": 32, "tie_word_embeddings": false, "transformers_version": "4.56.1", "use_cache": false, "vocab_size": 32128 }

tokenizerCLIPTokenizerNoneNoneNone00

None

tokenizer_2T5TokenizerFastNoneNoneNone00

None

transformerFluxTransformer2DModelxpu:0torch.bfloat16None119014083201279

FrozenDict({'patch_size': 1, 'in_channels': 64, 'out_channels': None, 'num_layers': 19, 'num_single_layers': 38, 'attention_head_dim': 128, 'num_attention_heads': 24, 'joint_attention_dim': 4096, 'pooled_projection_dim': 768, 'guidance_embeds': True, 'axes_dims_rope': [16, 56, 56], '_class_name': 'FluxTransformer2DModel', '_diffusers_version': '0.36.0.dev0', '_name_or_path': 'vladmandic/flux.1-dev-SRPO'})

schedulerFlowMatchEulerDiscreteSchedulerNoneNoneNone00

FrozenDict({'num_train_timesteps': 1000, 'shift': 3.0, 'use_dynamic_shifting': True, 'base_shift': 0.5, 'max_shift': 1.15, 'base_image_seq_len': 256, 'max_image_seq_len': 4096, 'invert_sigmas': False, 'shift_terminal': None, 'use_karras_sigmas': False, 'use_exponential_sigmas': False, 'use_beta_sigmas': False, 'time_shift_type': 'exponential', 'stochastic_sampling': False, '_class_name': 'FlowMatchEulerDiscreteScheduler', '_diffusers_version': '0.36.0.dev0'})

image_encoderNoneTypeNoneNoneNone00

None

feature_extractorNoneTypeNoneNoneNone00

None

...