Previous test was Test 21 - shuttle-3.1-aesthetic - steps and samplers
Info
https://huggingface.co/shuttleai/shuttle-jaguar
https://huggingface.co/shuttleai/shuttle-3-diffusion
https://huggingface.co/shuttleai/shuttle-3.1-aesthetic
height=1024,
width=1024,
guidance_scale=3.5,
num_inference_steps=4,Part 0 - cover and seeds
steps: 4, CFG: any, 1024px
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | |
|---|---|---|---|---|---|---|---|---|---|---|
| jaguar | ||||||||||
| shuttle 3 | ||||||||||
| shuttle 3.1 |
Part 1 - Bookshop
Part 1.1. Seeds
Prompt: photorealistic girl in bookshop choosing the book in romantic stories shelf. smiling
Parameters: Steps: 4| Size: 1024x1024| Seed: 1620085323| CFG scale: 3.5| App: SD.Next| Version: 058613e| Pipeline: FluxPipeline| Operations: txt2img| Model: shuttle-jaguar
285H Time: 43.00s | total 46.60 pipeline 40.25 decode 2.70 preview 1.91 prompt 0.89 gc 0.62 | GPU 35750 MB 28% | RAM 75.0 GB 61%
Prompt: photorealistic girl in bookshop choosing the book in romantic stories shelf. smiling
Parameters: Steps: 4| Size: 1024x1024| Seed: 1620085323| CFG scale: 3.5| App: SD.Next| Version: 058613e| Pipeline: FluxPipeline| Operations: txt2img| Model: shuttle-3-diffusion
285H Time: 42.07s | total 46.10 pipeline 38.99 decode 3.03 preview 1.52 prompt 0.91 te 0.90 gc 0.51 | GPU 35750 MB 28% | RAM 72.26 GB 59%
| STEP 4 | Seed: 1620085323 | Seed:1931701040 | Seed:4075624134 | Seed:2736029172 |
|---|---|---|---|---|
| jaguar | ||||
| shuttle 3 | ||||
| shuttle 3.1 |
Test 1.2 - steps and schedulers
| Euler FlowMatch | Shuttle Jaguar | Shuttle 3 Diffusion | Shuttle 3.1 aesthetic |
|---|---|---|---|
| step 4 | |||
| step 8 | |||
| step 16 | |||
| DPM2 FlowMatch | Shuttle Jaguar | Shuttle 3 Diffusion | Shuttle 3.1 aesthetic |
| step 4 | |||
| step 8 | |||
| step 16 | |||
| DPM2++ SDE FlowMatch | Shuttle Jaguar | Shuttle 3 Diffusion | Shuttle 3.1 aesthetic |
| step 4 | |||
| step 8 | |||
| step 16 |
Part 2 - Face and hand
Prompt: Create a close-up photograph of a woman's face and hand, with her hand raised to her chin. She is wearing a white blazer and has a gold ring on her finger. Her nails are neatly manicured and her hair is pulled back into a low bun. She is smiling and has a radiant expression on her face. The background is a plain light gray color. The overall mood of the photo is elegant and sophisticated. The photo should have a soft, natural light and a slight warmth to it. The woman's hair is dark brown and pulled back into a low bun, with a few loose strands framing her face.
| Euler FlowMatch | Shuttle Jaguar | Shuttle 3 Diffusion | Shuttle 3.1 aesthetic |
|---|---|---|---|
| step 4 | |||
| step 8 | |||
| step 16 | |||
| Euler FlowMatch | Shuttle Jaguar | Shuttle 3 Diffusion | Shuttle 3.1 aesthetic |
| step 4 | |||
| step 8 | |||
| step 16 | |||
| Euler FlowMatch | Shuttle Jaguar | Shuttle 3 Diffusion | Shuttle 3.1 aesthetic |
| step 4 | |||
| step 8 | |||
| step 16 |
Part 3 - Legs and ribbon
Prompt: Generate a photo of a woman's legs, with her feet crossed and wearing white high-heeled shoes with ribbons tied around her ankles. The shoes should have a pointed toe and a stiletto heel. The woman's legs should be smooth and tanned, with a slight sheen to them. The background should be a light gray color. The photo should be taken from a low angle, looking up at the woman's legs. The ribbons should be tied in a bow shape around the ankles. The shoes should have a red sole. The woman's legs should be slightly bent at the knee.
| Euler FlowMatch | Shuttle Jaguar | Shuttle 3 Diffusion | Shuttle 3.1 aesthetic |
|---|---|---|---|
| step 4 | |||
| step 8 | |||
| step 16 | |||
| Euler FlowMatch | Shuttle Jaguar | Shuttle 3 Diffusion | Shuttle 3.1 aesthetic |
| step 4 | |||
| step 8 | |||
| step 16 | |||
| Euler FlowMatch | Shuttle Jaguar | Shuttle 3 Diffusion | Shuttle 3.1 aesthetic |
| step 4 | |||
| step 8 | |||
| step 16 |
System Info
Tue Dec 16 08:43:27 2025 app: sdnext.git updated: 2025-12-15 hash: 058613e49 url: https://github.com/liutyi/sdnext/tree/pytorch arch: x86_64 cpu: x86_64 system: Linux release: 6.17.0-8-generic python: 3.12.3 PyTorch 2.9.1+xpu device: Intel(R) Arc(TM) Graphics (1) ipex: ram: free:86.67 used:36.4 total:123.07 xformers: diffusers: 0.36.0.dev0 transformers: 4.57.3 active: xpu dtype: torch.bfloat16 vae: torch.bfloat16 unet: torch.bfloat16 base: shuttleai/shuttle-3-diffusion refiner: None vae: Automatic te: Default unet: Default ipex native none Scaled-Dot-Product
Model Data
| Module | Class | Device | Dtype | Quant | Params | Modules | Config |
|---|---|---|---|---|---|---|---|
| vae | AutoencoderKL | xpu:0 | torch.bfloat16 | None | 83819683 | 241 | FrozenDict({'in_channels': 3, 'out_channels': 3, 'down_block_types': ['DownEncoderBlock2D', 'DownEncoderBlock2D', 'DownEncoderBlock2D', 'DownEncoderBlock2D'], 'up_block_types': ['UpDecoderBlock2D', 'UpDecoderBlock2D', 'UpDecoderBlock2D', 'UpDecoderBlock2D'], 'block_out_channels': [128, 256, 512, 512], 'layers_per_block': 2, 'act_fn': 'silu', 'latent_channels': 16, 'norm_num_groups': 32, 'sample_size': 1024, 'scaling_factor': 0.3611, 'shift_factor': 0.1159, 'latents_mean': None, 'latents_std': None, 'force_upcast': True, 'use_quant_conv': False, 'use_post_quant_conv': False, 'mid_block_add_attention': True, '_class_name': 'AutoencoderKL', '_diffusers_version': '0.30.0.dev0', '_name_or_path': '/mnt/models/Diffusers/models--shuttleai--shuttle-3.1-aesthetic/snapshots/6e1883042a221853a4b8a89cd6b563bcfe784741/vae'}) |
| text_encoder | CLIPTextModel | xpu:0 | torch.bfloat16 | None | 123060480 | 152 | CLIPTextConfig { "architectures": [ "CLIPTextModel" ], "attention_dropout": 0.0, "bos_token_id": 0, "dropout": 0.0, "dtype": "bfloat16", "eos_token_id": 2, "hidden_act": "quick_gelu", "hidden_size": 768, "initializer_factor": 1.0, "initializer_range": 0.02, "intermediate_size": 3072, "layer_norm_eps": 1e-05, "max_position_embeddings": 77, "model_type": "clip_text_model", "num_attention_heads": 12, "num_hidden_layers": 12, "pad_token_id": 1, "projection_dim": 768, "transformers_version": "4.57.3", "vocab_size": 49408 } |
| text_encoder_2 | T5EncoderModel | cpu | torch.bfloat16 | None | 4762310656 | 463 | T5Config { "architectures": [ "T5EncoderModel" ], "classifier_dropout": 0.0, "d_ff": 10240, "d_kv": 64, "d_model": 4096, "decoder_start_token_id": 0, "dense_act_fn": "gelu_new", "dropout_rate": 0.1, "dtype": "bfloat16", "eos_token_id": 1, "feed_forward_proj": "gated-gelu", "initializer_factor": 1.0, "is_encoder_decoder": false, "is_gated_act": true, "layer_norm_epsilon": 1e-06, "model_type": "t5", "num_decoder_layers": 24, "num_heads": 64, "num_layers": 24, "output_past": true, "pad_token_id": 0, "relative_attention_max_distance": 128, "relative_attention_num_buckets": 32, "tie_word_embeddings": false, "transformers_version": "4.57.3", "use_cache": false, "vocab_size": 32128 } |
| tokenizer | CLIPTokenizer | None | None | None | 0 | 0 | None |
| tokenizer_2 | T5TokenizerFast | None | None | None | 0 | 0 | None |
| transformer | FluxTransformer2DModel | xpu:0 | torch.bfloat16 | None | 11891178560 | 1275 | FrozenDict({'patch_size': 1, 'in_channels': 64, 'out_channels': None, 'num_layers': 19, 'num_single_layers': 38, 'attention_head_dim': 128, 'num_attention_heads': 24, 'joint_attention_dim': 4096, 'pooled_projection_dim': 768, 'guidance_embeds': False, 'axes_dims_rope': [16, 56, 56], '_use_default_values': ['out_channels'], '_class_name': 'FluxTransformer2DModel', '_diffusers_version': '0.30.3', '_name_or_path': '/mnt/models/Diffusers/models--shuttleai--shuttle-3.1-aesthetic/snapshots/6e1883042a221853a4b8a89cd6b563bcfe784741/transformer'}) |
| scheduler | FlowMatchEulerDiscreteScheduler | None | None | None | 0 | 0 | FrozenDict({'num_train_timesteps': 1000, 'shift': 1.0, 'use_dynamic_shifting': False, 'base_shift': 0.5, 'max_shift': 1.15, 'base_image_seq_len': 256, 'max_image_seq_len': 4096, 'invert_sigmas': False, 'shift_terminal': None, 'use_karras_sigmas': False, 'use_exponential_sigmas': False, 'use_beta_sigmas': False, 'time_shift_type': 'exponential', 'stochastic_sampling': False, '_use_default_values': ['time_shift_type', 'invert_sigmas', 'use_beta_sigmas', 'use_karras_sigmas', 'shift_terminal', 'stochastic_sampling', 'use_exponential_sigmas'], '_class_name': 'FlowMatchEulerDiscreteScheduler', '_diffusers_version': '0.30.3'}) |
| image_encoder | NoneType | None | None | None | 0 | 0 | None |
| feature_extractor | NoneType | None | None | None | 0 | 0 | None |