Real — Veo 3 Fashion Film Prompt Studio

Make it real.

Ready to use in:

A Veo 3 prompt studio built for fashion film first — editorials, runway, beauty, and luxury campaigns — and ready for short films, music videos, and visualizers too. The system combines prompt retrieval, curated aesthetic models, generation controls, and visual prompt engineering to improve controllability and consistency in AI-generated video.

Currently available as a public prototype with potential future commercialization.

Over the past year, I have spent a significant amount of time experimenting with Veo 3 and other AI video-generation models while creating fashion films, editorials, and runway concepts for my Bella PI YouTube channel.

▶️ Watch the videos: youtube.com/@BellaPi314 · For more examples, visit my website.

📖 Why I built this: a math-and-physics student, AI video, and the patterns behind the shots that worked — read the motivation →

Through hundreds of prompt iterations, I began documenting recurring patterns that consistently produced stronger visual results. This project is an attempt to organize those observations into a reusable prompt-engineering framework that helps transform rough ideas into structured cinematic briefs.

The platform combines prompt retrieval, visual metadata modeling, aesthetic classification, and structured prompt assembly to generate more coherent Veo 3 prompts — fashion editorials, luxury campaigns, runway films, and beauty close-ups first, with the same engine supporting short films, music videos, and visualizers.

Project Status: Active Prototype (Potential Future Product)

Try the prototype

Topics: veo3 · fashion-film · fashion-video · ai-fashion · fashion-editorial · runway · veo3-prompt-generator · cinematic-prompt-engineering · google-veo · gemini · text-to-video

Why AI Video Generation?

Generative AI is rapidly changing how visual content is created across advertising, entertainment, fashion, social media, and digital marketing. Companies increasingly use AI-generated video to prototype campaigns, explore creative concepts, build promotional content, and accelerate production workflows.

As video-generation models become more capable, the challenge is no longer simply generating content. It is directing the model toward a specific visual outcome.

This project explores how structured prompt engineering, visual metadata, and reusable aesthetic models can improve controllability and consistency in AI-generated video. By organizing successful visual patterns into reusable frameworks, the platform helps creators communicate visual intent more effectively and produce stronger cinematic results.

What It Does

Rewrites rough visual ideas into structured Veo 3 prompts.
Uses visual models such as Runway Couture, Beauty Editorial, Minimalist Atmosphere, Cinematic Storytelling, Architectural Muse, and more.
Adds Studio Space generation controls for dialogue mode, music context, and output type.
Retrieves curated prompt examples from the internal prompt library.
Assembles prompts into a repeatable visual-director brief format.
Plays rotating demo videos from hosted Cloudinary assets.
Supports local development with a private .env API key.
Supports deployment with a Render backend so the Gemini key stays private.

Tech Stack

Frontend: HTML, CSS, JavaScript, React, Babel, Tailwind CDN, Lucide-style icon usage, inline SVG brand icons.
Backend: Node.js with the built-in http module.
AI Provider: Google Gemini API through a backend proxy.
Deployment: GitHub repository connected to Render Web Service for the backend.
Media Hosting: Cloudinary for hosted demo videos.
Local Configuration: .env and .env.example.

The web app lives in index.html; the backend proxy lives in server.mjs. The prompt-engineering logic is centralized in core/ and reused by every surface (web app, MCP server, browser extension), so there is one source of truth.

Surfaces

The same Veo 3 director engine ships in several forms:

Web app — index.html + server.mjs (streaming output, Veo 3 controls).
MCP server — mcp-server/, usable in Claude Desktop, Claude Code, Cursor, etc.
Gemini CLI extension — gemini-extension/ (/veo3:improve, /veo3:models).
Chrome extension — chrome-extension/ (toolbar popup; generated from core/).
Claude skill — claude-skill/veo3-director/.

Architecture & security notes

The backend builds the Gemini prompt server-side from a constrained {idea, model, controls} payload (core/), so the proxy can't be abused as an open relay for arbitrary prompts.
CORS is locked to ALLOWED_ORIGINS (localhost always allowed) and the prompt endpoint is per-IP rate limited (RATE_LIMIT_MAX / RATE_LIMIT_WINDOW_MS).
Output streams over Server-Sent Events for progressive rendering.
Veo 3-native controls: aspect ratio, clip duration, and a negative prompt.

Project Files

index.html - Main single-page Studio Space app.
server.mjs - Node backend: validates input, builds prompts via core/, streams Gemini output.
core/ - Canonical director engine (director.mjs, models.json, library.json).
mcp-server/ - MCP server exposing the engine as tools.
gemini-extension/, chrome-extension/, claude-skill/ - Extension/skill packagings.
scripts/build-clients.mjs - Regenerates chrome-extension/data.js from core/.
scripts/test-core.mjs - Core engine tests (npm test).
.env.example - Template for Gemini, backend, security, and Cloudinary settings.
asset/ - Local demo video files.
cloudinary-videos.json - Cloudinary upload metadata.
scripts/upload-cloudinary.mjs - Helper script for uploading demo videos to Cloudinary.
package.json - Scripts: start, build:clients, mcp, test, upload:cloudinary.

Prompt Algorithm

The app uses a structured prompt architecture rather than a simple one-shot prompt.

Pipeline:

User Idea
↓
Visual Model Selection
↓
Dialogue Selection
↓
Music Selection
↓
Output Type
↓
Prompt Architecture Engine
↓
Veo 3 Prompt Output

The main algorithm happens in index.html inside the prompt generation flow.

When the user clicks UNLOCK YOUR CREATIVE POTENTIAL, the app:

Reads the rough user idea.
Reads the selected visual model.
Reads generation controls:
- dialogueMode
- musicMode
- outputType
Retrieves related examples from the prompt library.
Extracts visual keywords from the selected aesthetic model.
Builds a generationControls object:

{
  visualModel: "...",
  dialogueMode: "...",
  musicMode: "...",
  outputType: "..."
}

Creates a systemPrompt that defines the creative-director rules.
Creates userContent that includes the idea, selected model, keywords, examples, and controls.
Sends both values to the backend endpoint:

POST /api/improve-prompt

Displays the generated Veo 3 prompt in the output panel.

Current Control Logic

The dialogue, music, and output type controls are currently prompt-guided.

That means:

The selected values are passed into prompt assembly.
The system prompt explains how Gemini should interpret them.
The model uses those controls to shape the final prompt.

There is not yet a deeper rule-based validation layer that automatically checks and regenerates outputs when the model disobeys the selected controls.

Future improvement:

User Selections
↓
Control Policy Layer
↓
Prompt Assembly
↓
Model Generation
↓
Output Validation
↓
Optional Regeneration

This would make the controls behave more like a strict production pipeline.

Backend API Flow

server.mjs protects the API key and handles the Gemini request.

Backend flow:

Loads .env values.
Exposes GET /api/health and GET /api/options.
Exposes POST /api/improve-prompt.
Receives a constrained { idea, visualModel, dialogueMode, musicMode, outputType, aspectRatio, duration, negativePrompt, modifier } payload, validates it against the known option sets, and assembles the prompt server-side via core/.
Enforces a CORS allowlist (ALLOWED_ORIGINS, plus same-origin and localhost) and per-IP rate limiting.
Calls Gemini with the private API key, retrying temporary failures such as rate limits or service overload.
Streams the result over Server-Sent Events (or returns { text } for non-streaming requests).

The browser never sees the Gemini API key, and the endpoint cannot be used to relay arbitrary prompts.

Render Deployment Pipeline

This project is designed so the frontend can live publicly while the Gemini key stays private on Render.

Recommended production structure:

GitHub repo
↓
Render Web Service
↓
server.mjs backend
↓
Gemini API

If the frontend is hosted separately on GitHub Pages, the frontend should call the Render backend URL for prompt generation.

Example:

https://your-render-service.onrender.com/api/improve-prompt

Render setup:

Service Type: Web Service
Runtime: Node
Branch: main
Root Directory: leave empty
Build Command: npm install
Start Command: npm start

Render environment variables:

CREATIVE_API_KEY=your_gemini_api_key_here
CREATIVE_MODEL=gemini-2.5-flash
HOST=0.0.0.0

Render provides PORT automatically, so you usually do not need to set AVR_BACKEND_PORT on Render.

Important: do not set the build command to npn install. It must be:

npm install

Run Locally

You can open index.html directly for the interface, but prompt generation needs the backend running.

1. Install Node.js

Use Node.js 18 or newer.

2. Create `.env`

Copy .env.example to .env:

CREATIVE_API_KEY=your_gemini_api_key_here
CREATIVE_MODEL=gemini-2.5-flash
AVR_BACKEND_PORT=8787
HOST=127.0.0.1

CREATIVE_API_KEY is your direct Gemini API key. Replace the placeholder with your real key only in .env.

3. Start the Backend

npm start

By default, the backend runs at:

http://localhost:8787

Then open:

http://localhost:8787

Using Your Own API Key

My current public online demo uses a free API tier of Google Gemini Flash 2.5, which can sometimes be delayed by rate limits.

Running locally with your own key gives you more control and usually better performance. You can also plug the project into your own Gemini setup and use a more advanced model by changing CREATIVE_MODEL in .env.

The API key is stored only in .env locally or in Render environment variables in production. It should never be pasted into index.html.

Demo Videos

Demo videos are referenced from Cloudinary inside index.html. The local asset/ folder keeps the original video files.

To upload videos again, configure Cloudinary variables in .env:

CLOUDINARY_CLOUD_NAME=your_cloud_name
CLOUDINARY_API_KEY=your_cloudinary_api_key
CLOUDINARY_API_SECRET=your_cloudinary_api_secret
CLOUDINARY_FOLDER=veo3-prompt-improver

Then run:

npm run upload:cloudinary

This updates cloudinary-videos.json with hosted video information.

Security Notes

Do not commit .env.
Do not expose the Gemini API key in frontend code.
Store production secrets in Render environment variables.
Rotate any key immediately if it is accidentally committed or exposed.

Name		Name	Last commit message	Last commit date
Latest commit History 85 Commits
.github/workflows		.github/workflows
asset		asset
chrome-extension		chrome-extension
claude-skill		claude-skill
core		core
gemini-extension		gemini-extension
mcp-server		mcp-server
scripts		scripts
.env.example		.env.example
.gitignore		.gitignore
Agent.md		Agent.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MOTIVATION.md		MOTIVATION.md
PRODUCT.md		PRODUCT.md
README.md		README.md
apple-touch-icon.png		apple-touch-icon.png
cloudinary-videos.json		cloudinary-videos.json
favicon.png		favicon.png
index.html		index.html
logo-web.png		logo-web.png
logo.png		logo.png
logo.svg		logo.svg
package.json		package.json
server.mjs		server.mjs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Real — Veo 3 Fashion Film Prompt Studio

Why AI Video Generation?

What It Does

Tech Stack

Surfaces

Architecture & security notes

Project Files

Prompt Algorithm

Current Control Logic

Backend API Flow

Render Deployment Pipeline

Run Locally

1. Install Node.js

2. Create `.env`

3. Start the Backend

Using Your Own API Key

Demo Videos

Security Notes

Credits

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Real — Veo 3 Fashion Film Prompt Studio

Why AI Video Generation?

What It Does

Tech Stack

Surfaces

Architecture & security notes

Project Files

Prompt Algorithm

Current Control Logic

Backend API Flow

Render Deployment Pipeline

Run Locally

1. Install Node.js

2. Create .env

3. Start the Backend

Using Your Own API Key

Demo Videos

Security Notes

Credits

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

2. Create `.env`

Packages