Arcads is an AI-powered content creation platform that lets you generate product images, talking actor videos, B-roll clips, and full ad campaigns — without a studio, crew, or production budget.
This guide covers every feature available on the platform so you always know what's possible and where to find it.
🛠️ Tools
Tools are standalone utilities you can apply to any video or image you've generated — or use independently. Find them under See More on your dashboard.
Add Captions
Automatically generate and overlay subtitles onto any video. Captions are synced to the audio and styleable to match your brand. Ideal for social content watched without sound.
Transcribe
Convert the spoken audio from any video into a text transcript. Useful for reviewing script delivery, creating manual subtitles, or repurposing video content into written formats.
Translate Video
Localize any video into a different language automatically — including lip sync matched to the new audio. Select your target language, choose an accent if available, and Arcads generates the translated version. No re-filming required.
Text to Speech
Generate a voiceover from a written script using Arcads' AI voice library. Choose from dozens of voices across multiple languages and accents, or use a cloned voice for brand consistency.
Speech to Speech
Transform an existing audio recording into a different AI voice while preserving the timing and delivery pattern. Useful for voice swaps without re-recording the script.
Change Voice
Replace the voice in an existing video with a different AI voice. Useful for swapping accents, gender, or tone on already-generated videos without touching the visuals.
Change Speed
Speed up or slow down any video clip. Helpful for trimming duration to fit ad platform requirements (e.g., cutting a 20s video down to a 15s format).
Remove Background
Instantly remove the background from any image or video. The result is a clean cutout ready to be placed on a custom background or used in compositing — no manual masking needed.
Skin Enhancer
Improves the realism and quality of AI-generated faces. Adds natural skin detail and reduces the "AI look" — particularly useful for close-up talking-head content.
Upscale
Increase the resolution of any video using AI. Useful when your source footage is lower quality and needs to meet platform resolution requirements before publishing.
Hook Repurposer
Found a proven viral hook? Feed it in with your product details and the tool recreates it tailored to your brand. Great for testing multiple opening angles on a single script.
Text Overlay
Add text graphics on top of any video — titles, captions, CTAs, or branding. Customizable font, position, and timing.
Trim Video
Cut a video to a specific time range to isolate the best portion of a clip. Standard non-destructive editing — original file is preserved.
Resize Video
Change the aspect ratio or dimensions of any video (e.g., 16:9 → 9:16 for Reels/TikTok, 1:1 for feed). Essential for multi-platform distribution from a single generation.
Merge Layers
Combine multiple video or image layers into a single composited output. Used for adding B-roll overlays, product shots, or graphic elements on top of actor footage.
Stitch Videos
Join multiple video clips sequentially into one continuous video. Use this to combine an intro hook, main body, and outro CTA into a single finished file.
Extend Video
Lengthen any generated video by adding seamless continuation frames using AI. Useful when a clip is just slightly too short for your target duration.
Extract Frame
Pull a single still frame from any video. Great for generating thumbnails, using a specific moment as a reference image, or creating static ads from a video.
Camera Angle
Adjust or simulate a different camera angle on an existing image or video using AI. Repositions the shot without re-generating the whole scene from scratch.
Video Edit — Kling o3
A full AI-powered video editing interface driven by the Kling o3 model. Describe what you want changed in plain language — add, remove, or replace elements in a video — and the model applies the edit.
Video Edit — Grok
AI video editing powered by xAI's Grok model. An alternative to Kling o3 for instruction-based edits, offering a different visual style and output characteristic.
Animate Actor — Wan 2.2
Takes a static image of an actor and animates it into a short video using the Wan 2.2 model. Gives life to still photos with natural-looking movement — without a full Talking Actor pipeline.
Animate Actor — Kling 2.6 Pro Motion Control
Animates an actor image using Kling 2.6 Pro with specific, directed motion control parameters. More precise than standard animation — useful when you want the actor to move in a particular way.
Animate Actor — Kling 3.0 Pro Motion Control
The highest-quality motion control option for actor animation. Powered by Kling 3.0 Pro — best for premium outputs that need highly realistic, directed movement.
Fashion Try-On
Visualize any clothing item on an AI model without production, photography, or physical samples. Upload the garment and Arcads places it on a model of your choice. Ideal for fashion and apparel brands.
Split Into Scenes
Automatically detect scene changes and split a video into separate clips.
👉 Open Split Into Scenes
Create Music
Generate original music from a text prompt with adjustable duration and instrumental mode.
🖼️ Image Models
Arcads integrates multiple leading AI image generation models. Each has different strengths — choose based on your visual style, use case, and desired output quality.
GPT Image 1.5
OpenAI's image generation model integrated into Arcads. Reliable for generating product shots, lifestyle backgrounds, and general ad visuals from text prompts.
GPT Image 2
The newer, higher-quality version of OpenAI's image model. Produces sharper, more detailed, and more instruction-following outputs compared to GPT Image 1.5.
Nano-Banana Pro
Arcads' own proprietary image generation model. Fast and optimized for UGC-style ad imagery — great for quick iterations and high-volume image workflows.
Nano-Banana 2
The updated version of Nano-Banana with improved realism and visual consistency across generations. A step up in quality for the same fast, UGC-optimized workflow.
Seedream 4.5
A high-quality image generation model with strong creative and stylized output capabilities. Well-suited for editorial, aspirational, and lifestyle aesthetics.
Seedream 5 Lite
A faster, lighter variant of Seedream 5. Balances quality and speed — ideal for higher-volume workflows where you need good results quickly.
Grok Image
Image generation powered by xAI's Grok model. Offers a distinct visual style and output aesthetic compared to GPT or Seedream — useful for variety or when other models don't quite hit the look you want.
UGC Studio
A specialized image tool powered by the Soul engine. Generates authentic UGC-style images — selfie-aesthetic, natural lighting, lifestyle product contexts — rather than polished studio shots. Best for brands that want human, creator-feel imagery.
🎬 Video Models
These models generate B-roll clips, cinematic product shots, and movement-based video from your images or text prompts. Use them independently or combine with the Talking Actor pipeline for full ad productions.
Sora 2
OpenAI's Sora video generation model, integrated directly into Arcads. Generates high-quality videos from text prompts with strong scene consistency and photorealistic output.
Sora 2 Pro
The Pro variant of Sora 2. Delivers longer duration, higher resolution outputs, and more cinematic quality — best for hero shots and premium ad placements.
Veo 3.1
Google DeepMind's video generation model. Produces cinematic, high-fidelity video from text or image prompts. Known for strong temporal consistency and highly realistic motion — particularly good for lifestyle and environmental scenes.
Kling 2.6 Pro
A leading video generation model known for realistic motion and photorealism. A strong choice for lifestyle and product-focused video ads.
Kling 3.0 Pro
The flagship Kling model. Highest quality output in the Kling line — improved motion realism, better scene coherence, and more cinematic results. Recommended for publication-ready B-roll and product content.
Kling 3.0 4K
Kling 3.0 at 4K resolution. For content that requires the highest possible visual fidelity — premium ad placements, connected TV, or large-format display.
Kling 3 Pro Turbo
Generate fast 1080p videos from text or a starting image with Kling 3 Pro Turbo.
Seedance 1.5
Optimized for fluid, dynamic video generation. Particularly good for product movement, lifestyle scenes, and short-form ad content. A reliable workhorse for B-roll production.
Seedance 2.0
The updated Seedance model with improved motion quality and better handling of complex scenes. Use this over 1.5 for final outputs.
Happy Horse
A specialized video generation model within Arcads optimized for stylized and animated-style content. Best for creative, non-photorealistic outputs where you want a distinct visual character.
Grok Video
xAI's Grok-powered video generation. An alternative model option for users who want a different output style and aesthetic from the rest of the lineup.
🎙️ Talking Actor Models
Talking Actor models animate an AI character to deliver a spoken script on camera — lip-synced to the audio, with expressions and movement. These are the core of Arcads' ad creation pipeline.
Arcads 1.0
The original Arcads talking head model. Reliable and credit-efficient — best for volume production where speed and cost matter more than maximum realism. Good starting point for testing scripts before committing to higher-quality renders.
Audio-Driven
Generates a talking actor video where lip sync is driven by a provided audio file rather than a text script. Ideal when you already have a recorded voiceover and want to attach it to an actor without re-generating audio.
Omnihuman 1.5
The most advanced Talking Actor model on Arcads. Produces highly realistic full-body motion, gestures, and facial expressions — not just lip sync. Closest to real human video. Best for polished, publication-ready ads.
💡 Not sure which model to use? Compare all three Talking Actor models here.
🎭 Actors & Voices
Actor Library
Browse a diverse pre-built library of AI actors across ages, ethnicities, styles, and settings. Any actor can be animated with the Talking Actor models and filtered by category to find the right fit for your brand.
Custom Actor
Create your own AI actor from a single headshot image. Upload a photo, and Arcads generates an animatable actor in your chosen style — saved to your account for reuse across multiple videos.
Clone Yourself
Create a hyper-realistic digital twin of yourself — replicating both your face and your voice. Once cloned, generate unlimited videos without re-recording anything.
Voice Library & Voice Cloning
Choose from a library of AI voices across multiple languages and accents. Pro users can also clone any voice — upload a short audio sample and Arcads replicates it for consistent use across all your content.
Gestures
Add expressive hand movements and body language to any AI actor using preset gesture templates (thumbs up, pointing, open hands, and more). Makes actors appear more dynamic and engaging on screen.
📦 Product Features
Product Upload & Management
Import any product image into Arcads as the base for image generation, video creation, and product showcase content. Arcads detects the product automatically — no manual cropping required.
Show Your Product
Place your product directly into any generated video — held by an actor, displayed on a surface, or shown in a lifestyle context.
Show Your App
Generate content showing your app's UI alongside a talking actor or voiceover — no screen recording software or editing skills needed. Built for SaaS and mobile app companies.
Unboxing Videos
Generate unboxing-style product reveal videos from your product image — no physical box or filming required. Ideal for e-commerce product launch campaigns.
Actor Replacement
Swap the actor in an existing video with a different one while preserving the script, timing, and background. Great for A/B testing the same creative with different faces.
🎯 Presets
Presets are pre-configured workflows that combine multiple Arcads features into a single, guided creation flow. They're the fastest way to produce a specific type of content without configuring each step manually.
Sora 2 Actors
Combines Sora 2 video generation with AI actors in a pre-configured flow. Simplifies the process of creating high-quality actor-led video ads using Sora's capabilities.
Unboxing POV
Generates a first-person, point-of-view unboxing video. Great for e-commerce brands wanting product reveal content that feels authentic without physical filming.
Product Showcase
A preset focused on highlighting a product with close-ups, angles, and feature callouts. Optimized for e-commerce and DTC product ads where the product is the hero.
Camera Movement
Adds simulated camera motion (pan, zoom, dolly) to a static or generated video. Makes content feel more cinematic and professionally shot.
Gameplay Ad
A preset for generating mobile game advertising content. Typically overlays gameplay footage with a talking actor or hook-style intro to drive installs.
⚡ Workflows — Automate at Scale
Workflows let you connect multiple Arcads features into a single automated pipeline. Build once, generate at scale — instead of creating assets one by one, you run the whole sequence automatically.
What you can automate:
Take one script and generate videos across 10 actors in 2 languages (20 clips in one run)
Turn one winning ad into 20 UGC variations
Build a complete brand film from storyboard images using Start Frame & End Frame
Create a creator clone and scale unlimited content from it automatically
and many more!
How it works:
Click New Workflow on the top-left of the dashboard, or just open this: https://app.arcads.ai/flow
Add nodes (inputs, models, tools) and connect them in sequence
Run the workflow — everything generates automatically
Pre-built workflow templates are available for the most common use cases. Open one, swap in your assets, and run.
💳 Credits
The listed features above consume credits when you generate content. Credit usage varies by model — higher-quality models (Kling 3.0 Pro, Sora 2 Pro, etc.) use more credits per generation than faster ones.
Image generation is free (up to 100 per day). Video and Talking Actor generations consume credits, too.
🌍 Languages
Arcads, by default, supports content creation across a wide range of languages. It's been beefed up recently via the Translate Video and Text to Speech features. Pro accounts have additional voice customization with the help of ElevenLabs integration.
