What is Picture-in-Picture Creator?

Combine two videos by placing a smaller picture-in-picture window over a main video. Use it for reaction videos, webcam tutorials, gaming streams, or presentations with a speaker inset. Position and resize the overlay however you like.

FFmpeg runs entirely on your device through WebAssembly, so the videos never leave your machine. Drag the inset around a live preview to drop it at any pixel, or snap it to a corner in one tap. The optional border helps the inset stand out against busy backgrounds, the opacity slider lets you fade the inset to a ghost-overlay or watermark, the corner padding control sets how snug the inset sits against the edge, and the overlay volume slider mixes the overlay's audio into the main track when you want both voices to come through. An overlay start offset lets you begin from a later moment in the clip without trimming it first. Switch the overlay to a circle for a clean webcam look, mute the main track when only the narrator should be heard, and choose a 480p, 720p, or 1080p output to trade file size for quality.

How to use

  1. Upload your main (background) video and the overlay (PiP) video that will appear in a smaller window.
  2. Drag the inset on the live preview to place it anywhere, or tap a corner preset for a quick snap. Then set its shape — a rectangle or a webcam-style circle — and tune the overlay size, opacity, corner padding, optional border, and overlay audio volume. Set an overlay start offset to begin from a later point in the clip, mute the main video's audio when you only want the overlay's, and pick an output resolution.
  3. Click Create to render the combined video, then download the result.

When to use

  • Layering a webcam reaction over a clip you're commenting on for YouTube or TikTok.
  • Adding a presenter inset to a screen recording so a tutorial doesn't feel faceless.
  • Compositing gameplay footage with a small webcam feed for streaming recap videos.

Result

A gaming content creator overlays their 320x240 webcam footage in the bottom-right corner of their 1920x1080 gameplay recording, with a 2px white border around the webcam window.

FAQ

What video formats can I upload for the main and overlay clips?
MP4, WebM, MOV, and MKV all work. The two clips do not need to match in resolution or frame rate — the tool re-encodes both to a common H.264 MP4. The output uses the main video's duration as the timeline length.
Why does the output finish at the main video's length, not the overlay's?
The main clip drives the timeline. If the overlay is shorter, the last frame freezes for the remainder. If it is longer, anything past the main clip's end gets trimmed. Crop the overlay first if you need exact pacing.
How big can the input files be?
Practically capped by your device's RAM since processing happens locally. On 8 GB laptops, two 1080p clips of around 5 minutes each go through fine. Past 1 GB total input you'll see slow loading and possible tab crashes — split into shorter sections.
Will the audio from the overlay video be included?
Only the main video's audio plays by default, so the existing flow stays the same. Push the overlay volume slider above Muted to mix the overlay's audio into the main track at the level you choose. Reaction and commentary edits no longer need a separate audio editor when both clips carry sound.
Why does processing take a minute or two even for short clips?
FFmpeg has to decode both videos, composite each frame, and re-encode the result to H.264. Browsers run this single-threaded under WebAssembly, so a 1-minute 1080p output typically needs 60 to 120 seconds of CPU time. Close other heavy tabs to speed it up.

Related Tools