Most YouTube creators film too much and use too little. After a full day of shooting they end up with 4 hours of footage for a 12-minute video — and spend three times as long editing because they don't know which clips tell the story.

The solution isn't better footage. It's knowing what you need to film before you pick up the camera.

Why most beginner YouTubers over-film and under-use their footage

Without a shot plan, you film reactively — capturing whatever seems interesting. This creates two problems:

The 2x rule Plan to film 1.5–2x as many shots as you expect to use in the final edit. A 10-minute video with 40 cuts in the edit needs a shot list of about 60–80 planned shots.

Shot types: A-Roll, B-Roll, C-Roll, D-Roll

A-ROLL
Primary footage
You talking to camera. The spine of your video. Voiceover text goes here. Usually 30–50% of the final edit.
B-ROLL
Illustrative footage
Cutaway shots that illustrate what you're saying: close-ups, wide shots of locations, action sequences. Plays over A-Roll audio.
C-ROLL
Behind-the-scenes
Candid moments, the process of filming. Adds authenticity and personality. Common in vlogs and documentary content.
D-ROLL
Stock / archive
Footage you didn't film: stock video, screen recordings, graphics. Useful for context or concepts you can't film yourself.

Camera scale: wide, medium, close-up

Golden rule of coverage For every scene, film at least three different scales: wide, medium, close-up. This gives you the ability to cut between scales and create visual interest without needing more footage.

Camera movement for YouTube creators

How to build your shot plan from a script

The shot plan flows directly from your script. For each voiceover phrase:

  1. Ask: "what does the viewer need to see while they hear this?"
  2. Identify the primary shot — A-Roll for narration, B-Roll for illustrative content.
  3. Add coverage shots — 2–3 angles at different scales for each B-Roll moment.
  4. Mark camera movement — decide before you film whether each shot is static, panning, etc.

ClapLab does this automatically — it reads each script phrase and generates 2–5 shots with type, scale, and movement pre-specified.

A real shot plan example

For this script phrase: "I set off at 5am hoping to beat the crowds — but the trail was already busy."

Shot descriptionTypeScaleMovement
Me talking to camera — set off at 5amA-RollMediumStatic
Trail at dawn, establishing locationB-RollWidePan
Feet on trail, walking POVB-RollDetailPOV
Other hikers on trailB-RollMediumStatic
My face reacting to the crowdC-RollClose-upStatic

One script sentence generates five planned shots at three different scales. In the edit, you'll likely use 2–3 of these, cutting between them to match the voiceover pacing.

Using your shot plan on location

FAQ

What is A-Roll and B-Roll in YouTube videos? +

A-Roll is your primary footage — usually you speaking to camera. B-Roll is supplementary footage that illustrates what you're talking about: close-ups, location shots, action sequences. Most YouTube videos are about 40% A-Roll and 60% B-Roll.

Do you need a storyboard for YouTube videos? +

You don't need a drawn storyboard, but you do need a shot list — a written list of every shot you plan to film with notes on type, scale, and movement. Without it, you'll either over-film or miss key shots that make the edit fall apart.

How many shots do you need for a YouTube video? +

A typical 10-minute video requires 30–80 individual shots in the edit. To have enough coverage, plan 1.5–2x as many as you expect to use — about 50–100 shots in your list. ClapLab generates 2–5 shots per script phrase automatically.

Get a shot plan for your next YouTube video

ClapLab turns your idea into a complete script and shot plan in minutes. Free, no install needed.

Generate your shot plan →