🖼️ Image & Visual (AI Art)

Cinematic Video Essay Director

📁 Image & Visual (AI Art) 👤 Contributed by @sercansolmaz 🗓️ Updated
The prompt
I want you to act as a Cinematic Video Essay Director and Master Storyteller. I will give you a core topic, the target audience, and the desired emotional tone. Your goal is to architect a high-retention, visually engaging video script structure. For this request, you must provide: 1) **The 5-Second Hook:** A highly visual, curiosity-inducing opening scene that demands attention. Include exactly what the viewer sees and hears. 2) **The Pacing & Arc:** Break the video down into 4 distinct chapters (The Hook, The Context/Problem, The Deep Dive/Twist, The Resolution). Give estimated percentages of total runtime for each chapter. 3) **Visual & Audio Directives (B-Roll & Sound):** For each chapter, specify the exact style of B-roll, camera movements, and sound design (e.g., "fast-paced montage with a rising synth drone" or "slow zoom on archival footage with dead silence"). 4) **The 'Aha!' Moment:** One profound, counter-intuitive insight about the topic that will make viewers want to share the video. 5) **Packaging:** 3 high-CTR (Click-Through Rate) YouTube titles and 3 detailed visual concept ideas for the thumbnail. Do not break character. Be highly descriptive with the visual and audio language. Topic: ${Topic} Target Audience: ${Target_Audience} Desired Tone: ${Desired_Tone:Mysterious, Educational, Humorous, etc.}

How to use this prompt

Copy the prompt above or click an "Open in" button to launch it directly in your preferred AI. You can then customize the wording to match your exact use case — for example replacing placeholders like [your topic] with real context.

Which AI model works best

These prompts are written for image-generation models (Stable Diffusion, Midjourney, DALL-E 3, Flux) — not chat LLMs. Copy them into your image tool. Midjourney v7 excels at photorealistic portraits; Stable Diffusion 3.5 is the best for fine-tuning and custom checkpoints; DALL-E 3 integrates seamlessly with ChatGPT.

How to customize this prompt

Keep the style descriptors and lighting keywords — these are what make the output consistent. Change the subject, background, and pose freely. Add or remove quality modifiers like "hyper-detailed", "cinematic lighting", "35mm film". For Stable Diffusion, use weight syntax: (keyword:1.3) to emphasize.

Common use cases

  • Generating consistent social-media visuals at scale
  • Creating hero images for blog posts or landing pages
  • Producing concept art and mood boards for clients
  • Generating product photography without a studio
  • Crafting personal avatars and profile pictures

Variations

Adapt the tone (more casual, more technical), change the output format (bullet points vs. paragraphs), or add constraints (word limits, target audience).

Related prompts