Showing your product clearly and naturally in a video is one of the most effective ways to capture attention, build trust, and drive conversions.
With Arcads, you can showcase your product in multiple formats—from hands-on demos to creator-style videos—without filming, logistics, or complex setup.
This article explains the best ways to present your product in a video, along with practical tips to make it look authentic, engaging, and conversion-ready.
Most models available in Arcads allow you to create videos that showcase a product. Each model has its own strengths in terms of video length, creative flexibility, and level of realism. Below is a quick summary to help you choose the right one.
AI Models Comparison
AI Model | Talking Actor | Sora 2 Pro | Veo 3.1 |
Max video length | Up to 1,500 characters (≈ 40–60s) | 12 seconds max | 10 seconds max |
Voice control | Full voice control | No voice control | No voice control |
Realism | ++ | +++ | ++ |
Reference image | Any image | Product images only (no people) | Any image |
Start frame support | Any image | Not supported | Any image |
End frame support | Not supported | Not supported | Any image |
Creative flexibility | Actor face-cam only | Unlimited | Unlimited |
Best for | UGC ads, testimonials, explainers, talking-head videos | High-end cinematic product shots, brand visuals, realistic scenes, premium storytelling | Fast-paced hooks, creative ad intros, experimental visuals, motion-heavy social content |
We’ll demonstrate how to advertise this product using each model, from the easiest setup to the most advanced workflows.
Reference image (get the best possible quality!)
Use Case #1: UGC-Style Video
Talking actor presenting the product to the camera
Model Type: Talking Actor (Audio-Driven)
This workflow uses the Audio-Driven Talking Actor model to create a simple, authentic UGC-style video where an actor presents the product directly to the camera.
Advantages
Very easy to use
100% product consistency throughout the video
Supports videos up to ~40 seconds (600 characters)
Full control over voice type and accent
Limitations
Static actor
Limited creative flexibility
Best suited for face-cam style presentation only
Step-by-Step Guide
Step 1: Create a Custom Actor Holding Your Product
Since the actors available in the library are not yet holding your product, you’ll first need to create a custom actor showing the product (for example, a Stanley cup).
To do this:
Generate an image in Arcads of an actor holding your product
Go to the Image Generation tool
Upload your product image as a reference image
Write a prompt describing the actor and how they are holding the product
This custom image will then be used to generate a consistent talking-actor video featuring your product.
prompt used : "a 30 years old woman in a kitchen, holding the product in the reference image and showing it to the camera"
Here is the result ->
5.Once the image is generated, click on 'Actor" to transform the image into a talking actor
6. Pick a voice of your choice
6. and write your script "what the actor should say"
RESULT :
Model Type 2: Video Models
Using Sora 2 Pro and Veo 3.1
This workflow uses advanced video generation models to create highly realistic and creative product videos.
Advantages
High level of visual realism
Full creative freedom (camera movement, scenes, environments)
Ideal for cinematic or dynamic product shots
Limitations
Prompts must be written from scratch
Requires prompt-writing techniques to maintain full product consistency
Shorter video formats (up to 12 seconds)
No control over voice type
VO3.1 (up to 10 seconds per clip)
Since Veo 3.1 allows you to use a specific image as a start frame, we will reuse the image generated earlier and click Video to transform it into a video using Veo 3.1.
Here is the prompt " she says : I've been using this Stanley cup nonstop. It keeps my water cold all day, and I love how comfortable the handle is. It just fits into my routine.
RESULT :
SORA 2 PRO (up to 12 seconds per clip)
Sora does not support start frames or reference images that include people. As a result, the video must be generated entirely from scratch using a prompt
Here is the prompt
"A realistic UGC-style video filmed in natural daylight.
Use the provided product reference
A young adult woman stands in a bright, modern kitchen. She holds the product in the reference image close to the camera.
She looks directly into the camera and speaks naturally with a calm, friendly tone. Her body and hands remain mostly still while she talks.
The product must remain exactly identical throughout the entire video: same shape, size, color, proportions, handle, and straw.
No deformation, no warping, no rotation, no logo changes, no added or removed elements.
The camera is handheld but stable, realistic smartphone quality, no cinematic lighting, no slow motion, no dramatic camera movement.
The background remains consistent and realistic, with no changes to the environment.
Dialogue (spoken naturally):
“I’ve been using this Stanley cup nonstop.
It keeps my water cold all day,
and I love how comfortable the handle is.
It just fits into my routine.”
Here is the result :
Use Case #2: Let's get to the next level of creativity ! (advanced)
Please note the below video are without editing, raw asset generated.
Since we want to create a more creative format, the talking actor model is not suitable for this use case. Instead, we need to use Sora 2 Pro or Veo 3.1, as these models allow detailed scene descriptions, actions, and camera movement that go beyond a simple talking-head face cam.
Video #1: Cheers Moment (Using the Initial Start Frame VEO3.1)
Since we want to reuse the actor from the initial image, we need to use Veo 3.1. Veo allows the use of images containing people as start frames, whereas Sora only supports product images and does not allow start frames with people.
Starting from the initial image, we have the actor join friends in a living room. Everyone is holding a Stanley cup, and they come together to cheer and celebrate.
By uploading the initial image as the start frame in Veo 3.1, we ensure actor and scene consistency before animating the action.
Here is the prompt -> : "Create a realistic, UGC-style video filmed in natural daylight, total duration no more than 10 seconds.
Scene 1 (0–3s):
The woman from the reference image smiles, holding her Stanley cup with a handle and straw. She turns and takes a few natural steps from the kitchen toward the living room.
Scene 2 (3–6s):
She joins two friends seated casually in a bright, cozy living room. They smile and laugh naturally. Each friend holds a Stanley cup of the exact same design, in different solid colors.
Scene 3 (6–9s):
All three raise their Stanley cups and gently clink them together once in a friendly cheers.
Final beat (9–10s):
They hold the cups briefly at chest height, smiling, relaxed, authentic.
Product consistency rules:
Every Stanley cup remains identical in shape, size, handle position, lid, and straw design. Only the colors vary.
No deformation, no warping, no logo changes, no added elements.
Camera & style:
Handheld smartphone camera, stable, realistic movement. No slow motion, no cinematic effects"
Result :
Video #2: Product showcase video (Using Sora 2 pro)
Here is the prompt -> : "Create a 12-second ultra-chaotic, fruit-powered, product-only video featuring a Stanley cup.
The Stanley cup must remain the exact same model at all times:
same shape, proportions, handle, lid, logo placement, material finish.
Color changes are allowed, but only as clean, intentional swaps between real Stanley cup colors.
No distortion, no redesign, no stylization.
No people. No hands. No text. No voiceover.
UPDATED CHAOS SEQUENCE:
0–0.5s (SCROLL STOPPER)
Smash cut: lime explodes → ice CRACK → Stanley cup slams into frame.
Heavy bass hit.
0.5–2s
Micro-cuts (0.1s each):
• strawberry slice spins
• condensation bead slides
• ice flashes
• metal surface catches light
2–4s (UPDATED SECTION — COLOR PLAY)
Ultra-fast match cuts synced to beat drops.
Each cut swaps the Stanley cup color cleanly:
• pink Stanley cup on citrus-orange background
• green Stanley cup on neon-lime background
• blue Stanley cup on electric-blue background
Each color appears for ~0.2–0.3s.
Cup stays perfectly centered, unchanged except color.
Hard, graphic transitions. Zero motion blur.
4–6s
Juice splash choreography:
orange juice arcs around the cup
water droplets suspended mid-air
hyper-crisp, high shutter speed
6–8s
Rapid rotation accents (physically realistic):
front → handle → lid → logo
Snap zooms on beat
8–10s
Fruit chaos burst:
watermelon chunks
grapefruit wedges
mint leaves
Ice sounds stutter rhythmically
10–11.5s
Hard slowdown for contrast:
single Stanley cup
condensation dripping
ice settling inside
11.5–12s
Extreme macro punch-in on ice + rim
Instant cut to black (loop-friendly).
SOUND DESIGN (VERY IMPORTANT):
• Aggressive ice cracks
• Juicy splash impacts
• Deep bass hits synced to cuts
• No vocals
• No narration
RULES:
• Same Stanley cup model throughout
• Color swaps only (no design changes)
• Realistic physics
• No extra branding
• No text
Result :
Video #3: ITW style video (Using Sora 2 pro)
Here is the prompt -> : "Create a 12-second TV interview–style segment set on a modern lifestyle talk show.
The Stanley cup must keep the exact same shape, proportions, handle, lid, and logo placement as the reference image.
Only the color can change between shots.
No deformation, no redesign.
The tone should feel professional, cool, and accessible to a young audience.
SCENE FLOW:
0–2s (OPENING SHOT)
Wide studio shot.
Host and guest seated on modern chairs.
Bright, clean set with large windows and soft sunlight.
Stanley cups placed naturally on a small table between them.
2–4s
Cut to medium shot of the guest smiling while talking.
A colorful Stanley cup visible in the foreground.
Natural, relaxed body language.
4–6s
Cut to host nodding and smiling.
Another Stanley cup in a different color on the table.
Studio lights create soft reflections on the cup.
6–8s
Clean cut to close-up beauty shot of the Stanley cup:
• condensation
• handle detail
• ice visible inside
Feels like a broadcast cutaway.
8–10s
Return to two-shot.
Both host and guest laughing lightly.
Bright, positive, easy energy.
10–12s (CLOSING SHOT)
Hero shot of the Stanley cup centered on the table.
Warm sunlight flare.
Hold for a beat.
AUDIO DIRECTION:
• Soft studio ambience
• Light background music
• Natural laughter
• No scripted dialogue required
• No voiceover
IMPORTANT RULES:
• Same Stanley cup model as reference (color can be changed)
• Color variations allowed only
• No added graphics
• No lower thirds
• No text overlays"
Result :
Video #4: TV SHOW (using Sora2 pro)
Here is the prompt :
Use the provided Stanley cup image as the only visual reference.
Create a 12-second breaking-news-style advertisement, formatted like a live TV news broadcast.
Overall style:
Realistic television news show. Clean studio lighting. Professional but modern. Dynamic pacing. Feels like an actual breaking news segment, not a parody.
Scene 1 (0–3s | News studio intro):
A professional news presenter sits at a TV news desk in a modern broadcast studio. A subtle “Breaking News” style graphic appears on a large screen behind them.
The presenter looks directly into the camera and speaks with urgency and clarity, as if announcing an important update.
Scene 2 (3–6s | Cut to big screen):
The camera pushes slightly toward the large studio screen behind the presenter.
On the screen, a second presenter appears, standing in a bright, lifestyle environment. The screen-within-the-screen looks realistic, like a live remote feed.
Scene 3 (6–9s | Interview moment):
On the large screen, the second presenter interviews a young adult who is holding a Stanley cup upright at chest height.
The interview feels casual and authentic. The Stanley cup is clearly visible, stable, and unchanged.
Scene 4 (9–12s | Product focus close):
Cut closer to the screen content: the person with the Stanley cup smiles naturally, holding it confidently as if being asked about it on live TV.
The moment freezes briefly on a strong, clear visual of the Stanley cup, ending the segment like a news highlight.
Product consistency rules (critical):
The Stanley cup must remain identical in shape, size, proportions, handle position, lid, and straw design throughout the entire video.
No deformation, no warping, no logo changes, no floating objects, no condensation, no ice, no liquid effects.
Camera & motion rules:
Professional broadcast camera feel in the studio. Smooth, controlled movements. No cinematic effects. Clean cuts between scenes.
Mood & tone:
Urgent but polished. Informative, modern, slightly playful through format only. Feels like real television breaking news.
Result :
Video #5: Pick your best clips, and make a final edited video!








