You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 4 Next »

Info


DRAFT ARTICLE


https://huggingface.co/tencent/SRPO

Test 0 - Different seed variations

Prompt: photorealistic girl in bookshop choosing the book in romantic stories shelf. smiling

Parameters: Steps: 50| Size: 1024x1024| Seed: 1620085323| CFG scale: 6| App: SD.Next| Version: 82504e9| Pipeline: FluxPipeline| Operations: txt2img| Model: flux.1-dev-SRPO

Time: 27m 11.72s | total 1246.29 pipeline 1224.28 preview 7.57 decode 6.36 callback 3.69 prompt 2.02 te 2.01 gc 0.33 | GPU 34676 MB 29% | RAM 4.04 GB 3%


CFG6, STEP 32Seed: 1620085323Seed:1931701040Seed:4075624134Seed:2736029172
bookshop girl



hand and face



legs and shoes



Test 1 - Bookshop

Prompt: photorealistic girl in bookshop choosing the book in romantic stories shelf. smiling


48162032

CFG1






CFG2






CFG3






CFG4






CFG5






CFG6






CFG8







Test 2 - Face and hand

Prompt: Create a close-up photograph of a woman's face and hand, with her hand raised to her chin. She is wearing a white blazer and has a gold ring on her finger. Her nails are neatly manicured and her hair is pulled back into a low bun. She is smiling and has a radiant expression on her face. The background is a plain light gray color. The overall mood of the photo is elegant and sophisticated. The photo should have a soft, natural light and a slight warmth to it. The woman's hair is dark brown and pulled back into a low bun, with a few loose strands framing her face.


Parameters: Steps: 8| Size: 1024x1024| Seed: 1620085323| App: SD.Next| Version: e90ac68| Pipeline: FluxPipeline| Operations: txt2img| Model: FLUX.1-dev

Time: 3m 16.71s | total 269.85 pipeline 190.50 preview 69.13 decode 6.19 prompt 1.88 te 1.86 gc 0.26 | GPU 34674 MB 27% | RAM 2.93 GB 2%


Parameters: Steps: 20| Size: 1024x1024| Seed: 1620085323| CFG scale: 5| App: SD.Next| Version: e90ac68| Pipeline: FluxPipeline| Operations: txt2img| Model: FLUX.1-dev

Time: 8m 9.21s | total 608.23 pipeline 482.79 preview 114.79 decode 6.39 prompt 1.98 te 1.96 gc 0.26 | GPU 34672 MB 27% | RAM 2.96 GB 2%


Parameters: Steps: 32| Size: 1024x1024| Seed: 1620085323| CFG scale: 6| App: SD.Next| Version: e90ac68| Pipeline: FluxPipeline| Operations: txt2img| Model: FLUX.1-dev

Time: 12m 36.06s | total 1492.01 pipeline 749.41 preview 727.33 decode 6.62 prompt 4.19 te 4.17 | GPU 35284 MB 27% | RAM 3.03 GB 2%


8162032

CFG1





CFG2





CFG3





CFG3.5





CFG4





CFG5





CFG8





Test 3 - Legs

Prompt: Generate a photo of a woman's legs, with her feet crossed and wearing white high-heeled shoes with ribbons tied around her ankles. The shoes should have a pointed toe and a stiletto heel. The woman's legs should be smooth and tanned, with a slight sheen to them. The background should be a light gray color. The photo should be taken from a low angle, looking up at the woman's legs. The ribbons should be tied in a bow shape around the ankles. The shoes should have a red sole. The woman's legs should be slightly bent at the knee.



8162032

CFG1





CFG2





CFG3





CFG3.5





CFG4





CFG5





CFG8





Test 4 CivitAi profile cover generation

1600x400


Prompt: image with 5 canvases and one android robot painting on the 3rd canvas. first two is done with some futuristic images, last two is blank. robot is surprised look back at the camera. paint brush is in the robot hand. Left bottom corner text "Flux.1 Krea"

Parameters: Steps: 32| Size: 1600x400| Seed: 433989387| CFG scale: 3.5| App: SD.Next| Version: e90ac68| Pipeline: FluxPipeline| Operations: txt2img| Model: FLUX.1-dev


Prompt: image with 5 canvases and one android robot painting on the 3rd canvas. first two is done with some futuristic images, last two is blank. robot is surprised look back at the camera. paint brush is in the robot hand. Left bottom corner text "Flux.1 dev"

Parameters: Steps: 32| Size: 1600x400| Seed: 4014230639| CFG scale: 8| App: SD.Next| Version: e90ac68| Pipeline: FluxPipeline| Operations: txt2img| Model: FLUX.1-dev


Prompt: with analog noise and glitches, CivitAi cover image with 5 canvases and one android robot painting on the 3rd canvas. first two is done with some futuristic images, last two is blank. robot is surprised look back at the camera. paint brush is in the robot hand.

Parameters: Steps: 32| Size: 1600x400| Seed: 2404111820| CFG scale: 3.5| App: SD.Next| Version: e90ac68| Pipeline: FluxPipeline| Operations: txt2img| Model: FLUX.1-dev


Prompt: with analog noise and glitches, CivitAi cover image with 5 canvases and one android robot painting on the 3rd canvas. first two is done with some futuristic images, last two is blank. robot is surprised look back at the camera. paint brush is in the robot hand. Left bottom corner text "SD.Next + Flux.1 Krea dev", upper right corner text "Stable Diffusion at home"

Parameters: Steps: 32| Size: 1600x400| Seed: 109351024| CFG scale: 3.5| App: SD.Next| Version: e90ac68| Pipeline: FluxPipeline| Operations: txt2img| Model: FLUX.1-dev


Prompt: 80s cartoon style, CivitAi cover image with 5 canvases and one android robot painting on the 3rd canvas. first two is done with some futuristic images, last two is blank. robot is suprised look back at the camera. paint brush is in the robot hand. Left bottom corner text "SD.Next + Flux.1 dev", uper right corner text "Stable Diffusion at home"

Parameters: Steps: 20| Size: 1600x400| Seed: 1620085323| CFG scale: 6| App: SD.Next| Version: e90ac68| Pipeline: FluxPipeline| Operations: txt2img| Model: FLUX.1-dev


Prompt: 3d render style, CivitAi cover image with 5 canvases and one android robot painting on the 3rd canvas. first two is done with some futuristic images, last two is blank. robot is suprised look back at the camera. paint brush is in the robot hand. Left bottom corner text "SD.Next + Flux.1 dev", uper right corner text "Stable Diffusion at home"

Parameters: Steps: 20| Size: 1600x400| Seed: 1620085323| CFG scale: 6| App: SD.Next| Version: e90ac68| Pipeline: FluxPipeline| Operations: txt2img| Model: FLUX.1-dev


Prompt: Retro poster style, CivitAi cover image with 5 canvases and one android robot painting on the 3rd canvas. first two is done with some futuristic images, last two is blank. robot is suprised look back at the camera. paint brush is in the robot hand. Left bottom corner text "SD.Next + Flux.1 dev", uper right corner text "Stable Diffusion at home"

Parameters: Steps: 20| Size: 1600x400| Seed: 1620085323| CFG scale: 6| App: SD.Next| Version: e90ac68| Pipeline: FluxPipeline| Operations: txt2img| Model: FLUX.1-dev


Prompt: LEGO style, CivitAi cover image with 5 canvases and one android robot painting on the 3rd canvas. first two is done with some futuristic images, last two is blank. robot is surprised look back at the camera. paint brush is in the robot hand. Left bottom corner text "SD.Next + Flux.1 dev", upper right corner text "Stable Diffusion at home"

Parameters: Steps: 20| Size: 1600x400| Seed: 1620085323| CFG scale: 6| App: SD.Next| Version: e90ac68| Pipeline: FluxPipeline| Operations: txt2img| Model: FLUX.1-dev


Prompt: CivitAi cover image with 5 canvases and one android robot painting on the 3rd canvas. first two is done with some futuristic images, last two is blank. robot is surprised look back at the camera. paint brush is in the robot hand. Left bottom corner text "SD.Next + Flux.1 dev", upper right corner text "Stable Diffusion at home"

Parameters: Steps: 20| Size: 1600x400| Seed: 1620085323| CFG scale: 6| App: SD.Next| Version: e90ac68| Pipeline: FluxPipeline| Operations: txt2img| Model: FLUX.1-dev

Test 5 CFG 6 vs CFG 3.5

CFG3.5

CFG 6


System info


Mon Sep 22 09:38:44 2025
app: sdnext.git updated: 2025-09-21 hash: 82504e96 url: https://github.com/vladmandic/sdnext.git/tree/dev
arch: x86_64 cpu: x86_64 system: Linux release: 6.14.0-29-generic python: 3.12.3

Torch: 2.7.1+xpu device: Intel(R) Arc(TM) Graphics (1) ipex: 2.7.10+xpu

ram: free:121.28 used:4.05 total:125.33
gpu: free:81.62 used:35.75 total:117.37
gpu-active: current:31.63 peak:33.68
gpu-allocated: current:31.63 peak:33.68
gpu-reserved: current:35.75 peak:35.75
gpu-inactive: current:0.13 peak:0.74
events: retries:0 oom:0
utilization: 0

xformers: diffusers: 0.36.0.dev0 transformers: 4.56.1
active: xpu dtype: torch.bfloat16 vae: torch.bfloat16 unet: torch.bfloat16
base: vladmandic/flux.1-dev-SRPO refiner: none vae: none te: none unet: none
Backend: ipex Cross-attention: Scaled-Dot-Product


Config

{
  "theme_type": "Standard",
  "diffusers_version": "efb7a299af46d739dec6a57a5d2814165fba24b5",
  "diffusers_offload_mode": "none",
  "ui_request_timeout": 300000,
  "huggingface_token": "hf_..FraU",
  "samples_filename_pattern": "[date]-[seq]-[model_name]-[height]x[width]-Seed[seed]-CFG[cfg]-AG[pag]-STEP[steps]",
  "sd_model_checkpoint": "vladmandic/flux.1-dev-SRPO",
  "sd_checkpoint_hash": null,
  "hf_transfer_mode": "xet",
  "extra_networks_sort": "sort: no cards",
  "schedulers_solver_order": 1,
  "schedulers_sigma": "flowmatch",
  "schedulers_prediction_type": "flow_prediction",
  "schedulers_timestep_spacing": "leading",
  "schedulers_beta_schedule": "scaled",
  "cuda_compile_backend": "none",
  "diffusers_to_gpu": true,
  "device_map": "gpu",
  "queue_paused": true
}


Model info


ModuleClassDeviceDTypeParamsModulesConfig
vaeAutoencoderKLxpu:0torch.bfloat16None83819683241

FrozenDict({'in_channels': 3, 'out_channels': 3, 'down_block_types': ['DownEncoderBlock2D', 'DownEncoderBlock2D', 'DownEncoderBlock2D', 'DownEncoderBlock2D'], 'up_block_types': ['UpDecoderBlock2D', 'UpDecoderBlock2D', 'UpDecoderBlock2D', 'UpDecoderBlock2D'], 'block_out_channels': [128, 256, 512, 512], 'layers_per_block': 2, 'act_fn': 'silu', 'latent_channels': 16, 'norm_num_groups': 32, 'sample_size': 1024, 'scaling_factor': 0.3611, 'shift_factor': 0.1159, 'latents_mean': None, 'latents_std': None, 'force_upcast': True, 'use_quant_conv': False, 'use_post_quant_conv': False, 'mid_block_add_attention': True, '_class_name': 'AutoencoderKL', '_diffusers_version': '0.36.0.dev0', '_name_or_path': '/mnt/models/Diffusers/models--vladmandic--flux.1-dev-SRPO/snapshots/690eab4805ea624e20d1abbe4318afba7cccd6b1/vae'})

text_encoderCLIPTextModelxpu:0torch.bfloat16None123060480152

CLIPTextConfig { "architectures": [ "CLIPTextModel" ], "attention_dropout": 0.0, "bos_token_id": 0, "dropout": 0.0, "dtype": "bfloat16", "eos_token_id": 2, "hidden_act": "quick_gelu", "hidden_size": 768, "initializer_factor": 1.0, "initializer_range": 0.02, "intermediate_size": 3072, "layer_norm_eps": 1e-05, "max_position_embeddings": 77, "model_type": "clip_text_model", "num_attention_heads": 12, "num_hidden_layers": 12, "pad_token_id": 1, "projection_dim": 768, "transformers_version": "4.56.1", "vocab_size": 49408 }

text_encoder_2T5EncoderModelxpu:0torch.bfloat16None4762310656463

T5Config { "architectures": [ "T5EncoderModel" ], "classifier_dropout": 0.0, "d_ff": 10240, "d_kv": 64, "d_model": 4096, "decoder_start_token_id": 0, "dense_act_fn": "gelu_new", "dropout_rate": 0.1, "dtype": "bfloat16", "eos_token_id": 1, "feed_forward_proj": "gated-gelu", "initializer_factor": 1.0, "is_encoder_decoder": false, "is_gated_act": true, "layer_norm_epsilon": 1e-06, "model_type": "t5", "num_decoder_layers": 24, "num_heads": 64, "num_layers": 24, "output_past": true, "pad_token_id": 0, "relative_attention_max_distance": 128, "relative_attention_num_buckets": 32, "tie_word_embeddings": false, "transformers_version": "4.56.1", "use_cache": false, "vocab_size": 32128 }

tokenizerCLIPTokenizerNoneNoneNone00

None

tokenizer_2T5TokenizerFastNoneNoneNone00

None

transformerFluxTransformer2DModelxpu:0torch.bfloat16None119014083201279

FrozenDict({'patch_size': 1, 'in_channels': 64, 'out_channels': None, 'num_layers': 19, 'num_single_layers': 38, 'attention_head_dim': 128, 'num_attention_heads': 24, 'joint_attention_dim': 4096, 'pooled_projection_dim': 768, 'guidance_embeds': True, 'axes_dims_rope': [16, 56, 56], '_class_name': 'FluxTransformer2DModel', '_diffusers_version': '0.36.0.dev0', '_name_or_path': 'vladmandic/flux.1-dev-SRPO'})

schedulerFlowMatchEulerDiscreteSchedulerNoneNoneNone00

FrozenDict({'num_train_timesteps': 1000, 'shift': 3.0, 'use_dynamic_shifting': True, 'base_shift': 0.5, 'max_shift': 1.15, 'base_image_seq_len': 256, 'max_image_seq_len': 4096, 'invert_sigmas': False, 'shift_terminal': None, 'use_karras_sigmas': False, 'use_exponential_sigmas': False, 'use_beta_sigmas': False, 'time_shift_type': 'exponential', 'stochastic_sampling': False, '_class_name': 'FlowMatchEulerDiscreteScheduler', '_diffusers_version': '0.36.0.dev0'})

image_encoderNoneTypeNoneNoneNone00

None

feature_extractorNoneTypeNoneNoneNone00

None


  • No labels