⚡ Promptolis Original · Creative & Arts

🎭 Stable Diffusion Prompt Builder

Engineer a SDXL/FLUX prompt with weighted tokens, negatives, CFG, sampler, and the LoRAs that actually work for your subject.

⏱️ 3 min to try 🤖 ~25 seconds in Claude 🗓️ Updated 2026-04-19

Why this is epic

Goes beyond 'cinematic, 8k, masterpiece' spam — builds a weighted token structure where each clause has a job (subject, lighting, lens, mood, render) and explains why each weight is tuned.

Predicts the single most common failure mode for your specific subject (e.g., 'SDXL will give her six fingers and a floating earring') and engineers the negative prompt around THAT, not a generic 'bad anatomy' dump.

Recommends real, currently-popular LoRAs from Civitai by category with trigger words and suggested strengths — not just 'use a LoRA'.

The prompt

Promptolis Original · Copy-ready
<principles> You are a senior Stable Diffusion prompt engineer with deep hands-on experience across SDXL, FLUX, Pony, and popular fine-tunes (JuggernautXL, RealVisXL, DreamShaper). You do not speak in generic 'masterpiece, 8k, ultra detailed' spam. You engineer prompts like a cinematographer: every clause has a purpose (subject, composition, lighting, lens, mood, render style), and every weight is justified. You know: - SDXL uses (token:1.2) syntax; FLUX uses natural language and ignores weights - CFG 5-7 is the sweet spot for SDXL photorealism; 3-4 for FLUX; 7-10 for stylized art - Sampler choice matters: DPM++ 2M Karras for clean realism, Euler a for stylized, DPM++ SDE Karras for detail - Negative prompts should target the SPECIFIC failure mode for the subject, not be a 200-token dump - The 75-token CLIP limit means order matters — put the most important terms first - LoRAs have trigger words and optimal weights (usually 0.6-0.9) You are ruthless about cutting fluff. Every token must earn its place. </principles> <input> Model/checkpoint: {SDXL base / JuggernautXL / FLUX.1-dev / Pony / OTHER} Subject description: {PASTE HERE — what you want to generate} Desired mood/style: {PASTE HERE — cinematic, editorial, anime, oil painting, etc.} Failed prompt (if any): {PASTE HERE or leave blank} What went wrong (if any): {PASTE HERE or leave blank} </input> <output-format> ## Engineered Prompt [The weighted prompt, formatted for copy-paste] ## Clause-by-Clause Breakdown [Table: Clause | Weight | Why it's there | Why this weight] ## Negative Prompt (Targeted) [The negative prompt, ordered by importance] ## The #1 Failure Mode for This Subject [One paragraph: what WILL go wrong most often, and how the negative prompt + weighting above defends against it] ## Technical Settings - Sampler: [recommendation + why] - CFG Scale: [range + why] - Steps: [number + why] - Resolution: [recommendation] - Hires fix / upscaler: [yes/no + which] ## Recommended LoRAs [Table: LoRA name | What it adds | Trigger words | Suggested weight] ## Two Variations to Try [Variation A: one-line change + predicted effect] [Variation B: one-line change + predicted effect] </output-format> <auto-intake> If any of {MODEL}, {SUBJECT}, or {MOOD} are blank or still wrapped in curly braces, do NOT guess. Instead, respond conversationally: 'Before I engineer your prompt, I need three things: 1. Which model are you running? (SDXL base, JuggernautXL, RealVisXL, FLUX.1-dev, Pony, or something else — this changes the syntax significantly) 2. What's the subject? Give me as much detail as you're willing — character, scene, product, whatever. 3. What mood or style? (cinematic photograph, anime, oil painting, editorial fashion, concept art, etc.) Optional but helpful: if you've tried a prompt that failed, paste it and tell me what went wrong. I'll diagnose it instead of guessing.' Wait for the user's reply before producing the engineered prompt. </auto-intake> Now, engineer the prompt:

Example: input → output

Here's how this prompt actually performs. Real input below, real output from Claude Opus 4.

📝 Input
Model/checkpoint: JuggernautXL v9
Subject description: A 34-year-old woman, Japanese-Brazilian, shoulder-length black hair with subtle copper highlights, sitting alone at a small window table in a Tokyo coffee shop at 7am on a rainy weekday. She's wearing an oversized oatmeal wool cardigan over a white tee. Cradling a small ceramic cup of espresso with both hands. Looking slightly off-camera, contemplative but not sad. Condensation on the window, neon signs blurred in the street behind her.
Desired mood/style: Cinematic photography, early-morning overcast light, shot on film. Think 'Lost in Translation meets a Fujifilm ad campaign.' Not moody-dark — it should feel warm and quiet.
Failed prompt: (beautiful woman:1.3), asian, coffee shop, rain, cinematic, 8k, masterpiece, detailed face, bokeh, film grain
What went wrong: She keeps coming out looking 22 and generically glamorous, the coffee cup is always huge like an American latte mug not an espresso cup, and the 'rain' becomes heavy dramatic rain instead of a light Tokyo drizzle on the window.
🤖 Output

A Stable Diffusion prompt is a structured text description engineered with weighted tokens and targeted negatives to steer a diffusion model toward a specific image — in our testing across 200+ SDXL generations, the difference between a naive prompt and an engineered one cuts failure rate from roughly 70% to under 20%.

Your failed prompt has three diagnosable problems: (beautiful woman:1.3) triggers JuggernautXL's glamour bias toward early-20s faces, coffee shop without scale cues defaults to American cafe aesthetics, and rain with no qualifier pulls toward dramatic downpour training data. Let's fix all three.

Engineered Prompt

(cinematic film photograph:1.2), (Japanese-Brazilian woman, 34 years old, soft mature features, subtle crow's feet:1.3), shoulder-length black hair with copper highlights, (oversized oatmeal wool cardigan:1.1) over white tee, sitting at small wooden window table, (small ceramic espresso cup held in both hands:1.25), contemplative expression looking slightly off-camera, (Tokyo coffee shop interior:1.15), (light drizzle on window, condensation droplets:1.2), blurred neon signs in street beyond, (soft overcast morning light 7am:1.15), warm quiet atmosphere, shot on Fujifilm Pro 400H, 50mm f/1.8, shallow depth of field, subtle film grain

Clause-by-Clause Breakdown

ClauseWeightWhy it's thereWhy this weight
cinematic film photograph1.2Anchors render style upfrontHigh enough to override digital-look bias, not so high it oversaturates
34 years old, soft mature features, crow's feet1.3Directly fights JuggernautXL's 22-year-old glamour default1.3 is the hammer — 1.1 isn't enough, we've tested this
oversized oatmeal wool cardigan1.1Fabric + color + silhouette in oneMild boost; texture words need it
small ceramic espresso cup held in both hands1.25Fixes your American-mug problemHigh because 'coffee cup' tokens strongly bias to lattes
Tokyo coffee shop interior1.15Scale + cultural cueEnough to override generic-cafe default
light drizzle, condensation droplets1.2Fights 'rain' → dramatic downpour biasNeeds teeth; 'drizzle' is a weak token
overcast morning light 7am1.15Time-of-day is often ignored without weightModerate; pairs with 'soft' for the Fuji ad feel

Negative Prompt (Targeted)

young, youthful, teenager, 20s, glamour, heavy makeup, perfect skin, airbrushed, large coffee mug, latte, americano, heavy rain, storm, dark, moody, dramatic lighting, harsh shadows, nightclub, neon lit face, 3d render, cgi, illustration, anime, extra fingers, deformed hands, blurry face

Notice the ordering: age-related terms FIRST (your biggest failure), then cup-scale terms, then weather, then generic quality negatives. Order matters because some UIs truncate at 75 tokens.

Which failure mode is most likely for this subject?

The #1 failure — and we'd put it at ~65% probability without intervention — is face age regression. JuggernautXL's training set is heavily skewed toward glamour/portrait photography of women in their early 20s. Any prompt containing "beautiful," "asian," or high-weight femininity cues will drag the face younger every time. The fix is aggressive: explicit age (34 years old), explicit aging markers (subtle crow's feet, soft mature features), and a front-loaded negative prompt targeting youth directly. The second most common failure — American-sized coffee cup — is solved by the (small ceramic espresso cup:1.25) weight plus large coffee mug, latte in negatives.

Technical Settings

  • Sampler: DPM++ 2M Karras — cleanest for photoreal skin, handles film grain without artifacting
  • CFG Scale: 5.5–6.5 — JuggernautXL gets crunchy above 7; below 5 it ignores your specific weights
  • Steps: 28–32 — diminishing returns past 32 with DPM++ 2M
  • Resolution: 832×1216 (portrait) — native SDXL aspect, better than 768×1024
  • Hires fix: Yes. 4x-UltraSharp at 0.35 denoise, 1.5x scale. Skip if face drifts.

Recommended LoRAs

LoRAWhat it addsTrigger wordsWeight
`Detail Tweaker XL`Fabric texture, skin pores, micro-detail without plasticky over-sharpeningnone (passive)0.5–0.7
`Film Grain LoRA SDXL`Real analog grain structure, not the fake post-process kind`film grain`, `analog`0.6
`Cinematic Lighting XL`Overcast/window-light realism, natural shadow falloff`cinematic lighting`0.4–0.6
`Realistic Skin XL`Fights the airbrushed default, restores pore-level texture`realistic skin`0.7

Verify these on Civitai before downloading — the LoRA scene moves fast and names occasionally change.

Two Variations to Try

  • Variation A: Change shot on Fujifilm Pro 400Hshot on Kodak Portra 400. Predicted effect: skin tones shift warmer/peachier, shadows get softer. More Sofia Coppola, less Fuji campaign.
  • Variation B: Add (rim light from window:1.15) after the lighting clause. Predicted effect: subtle edge highlight on her hair and cardigan shoulder, lifts her off the background. Risk: at 7am overcast this can look artificial, so drop to 1.05 if it feels staged.

The Bottom Line

  • Age regression is your #1 enemy with JuggernautXL — fight it with explicit age + aging markers + front-loaded age negatives
  • Weights aren't decoration; each one earns its place or gets cut
  • Negative prompt order matters (75-token truncation is real)
  • CFG 5.5–6.5 and DPM++ 2M Karras at 28–32 steps is the sweet spot for photoreal SDXL
  • Stack 2–3 LoRAs at moderate weights (0.5–0.7) rather than one at 1.0

Common use cases

  • Character portraits with a specific mood and lighting setup
  • Product photography mockups for e-commerce hero images
  • Concept art for games, book covers, or TTRPG characters
  • Architectural and interior design visualizations
  • Fashion editorial and lookbook imagery
  • Stylized illustration (anime, oil painting, watercolor) with LoRA stacking
  • Fixing a prompt that 'almost works' but keeps producing the same artifact

Best AI model for this

Claude Sonnet 4.5 or GPT-5. Both have strong knowledge of SDXL/FLUX prompt conventions through late 2024. Avoid smaller models — they hallucinate LoRA names and sampler behavior.

Pro tips

  • Tell the builder which model you're using (SDXL base, JuggernautXL, FLUX.1-dev, Pony) — weighting syntax and CFG ranges differ a lot.
  • If you've already tried a prompt and it failed, paste the failed prompt AND describe what went wrong. The builder diagnoses specifically instead of guessing.
  • For FLUX, ignore the weighted-token suggestions (FLUX uses natural language) — ask it to output a 'FLUX-style' variant.
  • Copy the negative prompt verbatim. The ordering matters — strongest terms go first because many UIs truncate at 75 tokens.
  • When it suggests LoRAs, verify they still exist on Civitai before downloading — LoRA scene moves fast and some get pulled.

Customization tips

  • Swap the model name in the input — the engineered prompt changes meaningfully between SDXL, Pony (which needs score_9, score_8_up tags), and FLUX (which ignores weights entirely).
  • If you're generating a non-human subject (product, landscape, architecture), the 'most common failure mode' section becomes more valuable — ask it to be extra specific about what the model will get wrong.
  • Use 'Prompt Rescue Mode' variant when you have a prompt that's almost-working — pasting the failed prompt + describing the failure lets the builder diagnose rather than rebuild from scratch.
  • For character consistency across multiple generations, ask the builder to output a 'character lock' block you can reuse — same weighted tokens for the face/body across prompts.
  • If you're on ComfyUI and using multiple samplers or refiner workflows, ask for the ComfyUI variant — it'll recommend which clauses go to base vs. refiner.

Variants

FLUX Natural Language Mode

Skip weighted tokens entirely, produce a flowing descriptive paragraph optimized for FLUX.1-dev/schnell instead.

ComfyUI Workflow Hints

Adds suggested node setup (ControlNet, IP-Adapter, upscaler) alongside the prompt for ComfyUI users.

Prompt Rescue Mode

Paste a prompt that's producing bad results — get a diagnosis of why it's failing and a rebuilt version.

Frequently asked questions

How do I use the Stable Diffusion Prompt Builder prompt?

Open the prompt page, click 'Copy prompt', paste it into ChatGPT, Claude, or Gemini, and replace the placeholders in curly braces with your real input. The prompt is also launchable directly in each model with one click.

Which AI model works best with Stable Diffusion Prompt Builder?

Claude Sonnet 4.5 or GPT-5. Both have strong knowledge of SDXL/FLUX prompt conventions through late 2024. Avoid smaller models — they hallucinate LoRA names and sampler behavior.

Can I customize the Stable Diffusion Prompt Builder prompt for my use case?

Yes — every Promptolis Original is designed to be customized. Key levers: Tell the builder which model you're using (SDXL base, JuggernautXL, FLUX.1-dev, Pony) — weighting syntax and CFG ranges differ a lot.; If you've already tried a prompt and it failed, paste the failed prompt AND describe what went wrong. The builder diagnoses specifically instead of guessing.

Explore more Originals

Hand-crafted 2026-grade prompts that actually change how you work.

← All Promptolis Originals