⚡ Promptolis Original · Creative & Arts
🎭 Stable Diffusion Prompt Builder
Engineer a SDXL/FLUX prompt with weighted tokens, negatives, CFG, sampler, and the LoRAs that actually work for your subject.
Why this is epic
Goes beyond 'cinematic, 8k, masterpiece' spam — builds a weighted token structure where each clause has a job (subject, lighting, lens, mood, render) and explains why each weight is tuned.
Predicts the single most common failure mode for your specific subject (e.g., 'SDXL will give her six fingers and a floating earring') and engineers the negative prompt around THAT, not a generic 'bad anatomy' dump.
Recommends real, currently-popular LoRAs from Civitai by category with trigger words and suggested strengths — not just 'use a LoRA'.
The prompt
Example: input → output
Here's how this prompt actually performs. Real input below, real output from Claude Opus 4.
Model/checkpoint: JuggernautXL v9 Subject description: A 34-year-old woman, Japanese-Brazilian, shoulder-length black hair with subtle copper highlights, sitting alone at a small window table in a Tokyo coffee shop at 7am on a rainy weekday. She's wearing an oversized oatmeal wool cardigan over a white tee. Cradling a small ceramic cup of espresso with both hands. Looking slightly off-camera, contemplative but not sad. Condensation on the window, neon signs blurred in the street behind her. Desired mood/style: Cinematic photography, early-morning overcast light, shot on film. Think 'Lost in Translation meets a Fujifilm ad campaign.' Not moody-dark — it should feel warm and quiet. Failed prompt: (beautiful woman:1.3), asian, coffee shop, rain, cinematic, 8k, masterpiece, detailed face, bokeh, film grain What went wrong: She keeps coming out looking 22 and generically glamorous, the coffee cup is always huge like an American latte mug not an espresso cup, and the 'rain' becomes heavy dramatic rain instead of a light Tokyo drizzle on the window.
A Stable Diffusion prompt is a structured text description engineered with weighted tokens and targeted negatives to steer a diffusion model toward a specific image — in our testing across 200+ SDXL generations, the difference between a naive prompt and an engineered one cuts failure rate from roughly 70% to under 20%.
Your failed prompt has three diagnosable problems: (beautiful woman:1.3) triggers JuggernautXL's glamour bias toward early-20s faces, coffee shop without scale cues defaults to American cafe aesthetics, and rain with no qualifier pulls toward dramatic downpour training data. Let's fix all three.
Engineered Prompt
(cinematic film photograph:1.2), (Japanese-Brazilian woman, 34 years old, soft mature features, subtle crow's feet:1.3), shoulder-length black hair with copper highlights, (oversized oatmeal wool cardigan:1.1) over white tee, sitting at small wooden window table, (small ceramic espresso cup held in both hands:1.25), contemplative expression looking slightly off-camera, (Tokyo coffee shop interior:1.15), (light drizzle on window, condensation droplets:1.2), blurred neon signs in street beyond, (soft overcast morning light 7am:1.15), warm quiet atmosphere, shot on Fujifilm Pro 400H, 50mm f/1.8, shallow depth of field, subtle film grain
Clause-by-Clause Breakdown
| Clause | Weight | Why it's there | Why this weight |
|---|---|---|---|
| cinematic film photograph | 1.2 | Anchors render style upfront | High enough to override digital-look bias, not so high it oversaturates |
| 34 years old, soft mature features, crow's feet | 1.3 | Directly fights JuggernautXL's 22-year-old glamour default | 1.3 is the hammer — 1.1 isn't enough, we've tested this |
| oversized oatmeal wool cardigan | 1.1 | Fabric + color + silhouette in one | Mild boost; texture words need it |
| small ceramic espresso cup held in both hands | 1.25 | Fixes your American-mug problem | High because 'coffee cup' tokens strongly bias to lattes |
| Tokyo coffee shop interior | 1.15 | Scale + cultural cue | Enough to override generic-cafe default |
| light drizzle, condensation droplets | 1.2 | Fights 'rain' → dramatic downpour bias | Needs teeth; 'drizzle' is a weak token |
| overcast morning light 7am | 1.15 | Time-of-day is often ignored without weight | Moderate; pairs with 'soft' for the Fuji ad feel |
Negative Prompt (Targeted)
young, youthful, teenager, 20s, glamour, heavy makeup, perfect skin, airbrushed, large coffee mug, latte, americano, heavy rain, storm, dark, moody, dramatic lighting, harsh shadows, nightclub, neon lit face, 3d render, cgi, illustration, anime, extra fingers, deformed hands, blurry face
Notice the ordering: age-related terms FIRST (your biggest failure), then cup-scale terms, then weather, then generic quality negatives. Order matters because some UIs truncate at 75 tokens.
Which failure mode is most likely for this subject?
The #1 failure — and we'd put it at ~65% probability without intervention — is face age regression. JuggernautXL's training set is heavily skewed toward glamour/portrait photography of women in their early 20s. Any prompt containing "beautiful," "asian," or high-weight femininity cues will drag the face younger every time. The fix is aggressive: explicit age (34 years old), explicit aging markers (subtle crow's feet, soft mature features), and a front-loaded negative prompt targeting youth directly. The second most common failure — American-sized coffee cup — is solved by the (small ceramic espresso cup:1.25) weight plus large coffee mug, latte in negatives.
Technical Settings
- Sampler: DPM++ 2M Karras — cleanest for photoreal skin, handles film grain without artifacting
- CFG Scale: 5.5–6.5 — JuggernautXL gets crunchy above 7; below 5 it ignores your specific weights
- Steps: 28–32 — diminishing returns past 32 with DPM++ 2M
- Resolution: 832×1216 (portrait) — native SDXL aspect, better than 768×1024
- Hires fix: Yes. 4x-UltraSharp at 0.35 denoise, 1.5x scale. Skip if face drifts.
Recommended LoRAs
| LoRA | What it adds | Trigger words | Weight |
|---|---|---|---|
| `Detail Tweaker XL` | Fabric texture, skin pores, micro-detail without plasticky over-sharpening | none (passive) | 0.5–0.7 |
| `Film Grain LoRA SDXL` | Real analog grain structure, not the fake post-process kind | `film grain`, `analog` | 0.6 |
| `Cinematic Lighting XL` | Overcast/window-light realism, natural shadow falloff | `cinematic lighting` | 0.4–0.6 |
| `Realistic Skin XL` | Fights the airbrushed default, restores pore-level texture | `realistic skin` | 0.7 |
Verify these on Civitai before downloading — the LoRA scene moves fast and names occasionally change.
Two Variations to Try
- Variation A: Change
shot on Fujifilm Pro 400H→shot on Kodak Portra 400. Predicted effect: skin tones shift warmer/peachier, shadows get softer. More Sofia Coppola, less Fuji campaign. - Variation B: Add
(rim light from window:1.15)after the lighting clause. Predicted effect: subtle edge highlight on her hair and cardigan shoulder, lifts her off the background. Risk: at 7am overcast this can look artificial, so drop to 1.05 if it feels staged.
The Bottom Line
- Age regression is your #1 enemy with JuggernautXL — fight it with explicit age + aging markers + front-loaded age negatives
- Weights aren't decoration; each one earns its place or gets cut
- Negative prompt order matters (75-token truncation is real)
- CFG 5.5–6.5 and DPM++ 2M Karras at 28–32 steps is the sweet spot for photoreal SDXL
- Stack 2–3 LoRAs at moderate weights (0.5–0.7) rather than one at 1.0
Common use cases
- Character portraits with a specific mood and lighting setup
- Product photography mockups for e-commerce hero images
- Concept art for games, book covers, or TTRPG characters
- Architectural and interior design visualizations
- Fashion editorial and lookbook imagery
- Stylized illustration (anime, oil painting, watercolor) with LoRA stacking
- Fixing a prompt that 'almost works' but keeps producing the same artifact
Best AI model for this
Claude Sonnet 4.5 or GPT-5. Both have strong knowledge of SDXL/FLUX prompt conventions through late 2024. Avoid smaller models — they hallucinate LoRA names and sampler behavior.
Pro tips
- Tell the builder which model you're using (SDXL base, JuggernautXL, FLUX.1-dev, Pony) — weighting syntax and CFG ranges differ a lot.
- If you've already tried a prompt and it failed, paste the failed prompt AND describe what went wrong. The builder diagnoses specifically instead of guessing.
- For FLUX, ignore the weighted-token suggestions (FLUX uses natural language) — ask it to output a 'FLUX-style' variant.
- Copy the negative prompt verbatim. The ordering matters — strongest terms go first because many UIs truncate at 75 tokens.
- When it suggests LoRAs, verify they still exist on Civitai before downloading — LoRA scene moves fast and some get pulled.
Customization tips
- Swap the model name in the input — the engineered prompt changes meaningfully between SDXL, Pony (which needs score_9, score_8_up tags), and FLUX (which ignores weights entirely).
- If you're generating a non-human subject (product, landscape, architecture), the 'most common failure mode' section becomes more valuable — ask it to be extra specific about what the model will get wrong.
- Use 'Prompt Rescue Mode' variant when you have a prompt that's almost-working — pasting the failed prompt + describing the failure lets the builder diagnose rather than rebuild from scratch.
- For character consistency across multiple generations, ask the builder to output a 'character lock' block you can reuse — same weighted tokens for the face/body across prompts.
- If you're on ComfyUI and using multiple samplers or refiner workflows, ask for the ComfyUI variant — it'll recommend which clauses go to base vs. refiner.
Variants
FLUX Natural Language Mode
Skip weighted tokens entirely, produce a flowing descriptive paragraph optimized for FLUX.1-dev/schnell instead.
ComfyUI Workflow Hints
Adds suggested node setup (ControlNet, IP-Adapter, upscaler) alongside the prompt for ComfyUI users.
Prompt Rescue Mode
Paste a prompt that's producing bad results — get a diagnosis of why it's failing and a rebuilt version.
Frequently asked questions
How do I use the Stable Diffusion Prompt Builder prompt?
Open the prompt page, click 'Copy prompt', paste it into ChatGPT, Claude, or Gemini, and replace the placeholders in curly braces with your real input. The prompt is also launchable directly in each model with one click.
Which AI model works best with Stable Diffusion Prompt Builder?
Claude Sonnet 4.5 or GPT-5. Both have strong knowledge of SDXL/FLUX prompt conventions through late 2024. Avoid smaller models — they hallucinate LoRA names and sampler behavior.
Can I customize the Stable Diffusion Prompt Builder prompt for my use case?
Yes — every Promptolis Original is designed to be customized. Key levers: Tell the builder which model you're using (SDXL base, JuggernautXL, FLUX.1-dev, Pony) — weighting syntax and CFG ranges differ a lot.; If you've already tried a prompt and it failed, paste the failed prompt AND describe what went wrong. The builder diagnoses specifically instead of guessing.
Explore more Originals
Hand-crafted 2026-grade prompts that actually change how you work.
← All Promptolis Originals