Behind the Scenes·6 min read·2026年5月5日

Inside a Hybrid Shoot: When AI and Live Action Share the Same Frame

What actually happens on set when part of the scene is shot live and part is generated. A look inside our hybrid production workflow.

The phrase "AI in production" usually conjures fully synthetic content. But the most interesting work we're doing in 2026 isn't fully AI or fully live — it's hybrid, where real talent shot on a real set shares the frame with AI-generated environments and elements. Here's what that actually looks like in practice.

Planning the Seam

The hardest part of a hybrid shoot happens before anyone arrives on set. We have to decide, shot by shot, exactly where the line between live and generated falls — and plan both sides to meet seamlessly. This means precise pre-visualisation: AI-generated concept frames that double as a blueprint for what the live elements need to match. Lighting direction, camera height, lens choice, and colour temperature all have to be locked before the shoot, because the generated environment will be built to those specifications.

On Set: Shooting to Match

On the day, the live shoot is more disciplined than a conventional one. Talent performs against controlled backgrounds — often a clean cyclorama or a partial physical set — with lighting that exactly matches the planned AI environment. Camera moves are tracked and logged so the generated elements can be composited with correct perspective. The director is simultaneously directing a real performance and imagining the final composite that won't exist for days. It demands a particular kind of double vision.

Post: Where It Comes Together

In post-production, the generated environments are produced to match the locked specifications, then composited with the live footage. The colourist plays a crucial role here — unifying the live and generated elements into a single, believable image. This is where hybrid shoots live or die. A mismatch in lighting, grain, or colour science breaks the illusion instantly. Getting it right requires treating the generated and live elements as one coherent photographic system, not two layers stacked together.

Why Bother?

The payoff is creative and economic. Hybrid production lets us put real, emotionally resonant human performance into environments that would be impossible, dangerous, or prohibitively expensive to shoot practically — a product floating above a city at golden hour, talent walking through an environment that doesn't exist. We get the authenticity of real performance with the boundless scope of generated worlds, often at a fraction of the cost of building elaborate sets or travelling to exotic locations.

The Skill That Matters Most

Hybrid shoots have made one thing clear: the most valuable skill in this kind of production isn't prompting or compositing — it's the director's ability to hold the final image in their head and orchestrate both halves of it towards that vision. The technology provides the tools. The coherence comes from human judgement. As these workflows become standard, the directors who thrive will be the ones who can think fluidly across both the real and the generated — treating them not as separate disciplines but as one expanded canvas.

傾計一下