Overview
The Video Studio API enables you to convert static avatar images into dynamic video clips with natural motion, facial animations, and optional audio synchronization. Perfect for creating engaging content, animated presentations, and character-driven media.Base URL
Core Endpoints
| Method | Endpoint | Description |
|---|---|---|
| POST | /videos/from-image | Generate video from avatar image |
| GET | /videos/{videoId} | Get video generation status |
| GET | /videos | List your videos |
| POST | /videos/{videoId}/add-audio | Add audio track with lip-sync |
| PATCH | /videos/{videoId} | Update video metadata |
| DELETE | /videos/{videoId} | Delete video |
Video Object
Pricing
Basic Video Studio
- Base: 30 credits (includes first 5 seconds)
- Additional: +6 credits per second after 5s
- Quality: 720p, 30fps
- Max Duration: 10 seconds
Reality Lab
- Base: 20 credits (includes first 5 seconds)
- Additional: +4 credits per second after 5s
- Quality: 1080p, 30/60fps
- Max Duration: 30 seconds
Quick Start
Processing Times
| Duration | Tier | Typical Processing |
|---|---|---|
| 3-5s | Basic | 20-35 seconds |
| 6-10s | Basic | 40-75 seconds |
| 3-5s | Reality Lab | 35-60 seconds |
| 10-15s | Reality Lab | 90-150 seconds |
| 15-30s | Reality Lab | 3-5 minutes |
Motion Styles
| Style | Description | Best For |
|---|---|---|
subtle | Gentle breathing, soft blinks | Professional portraits, calm scenes |
moderate | Natural head movements, expressions | Conversations, introductions |
dynamic | Full range motion, dramatic expressions | Action content, music videos |
Video Formats
- MP4 (H.264)
- WebM (VP9)
- MOV (ProRes)
Best compatibility - Recommended for most uses
- Codec: H.264
- Container: MP4
- Compatibility: All browsers, mobile devices
- File size: Medium
Error Responses
invalid_image- Avatar image not found or inaccessibleinvalid_duration- Duration out of allowed rangeinsufficient_credits- Not enough creditsprocessing_failed- Video generation failedavatar_not_completed- Source avatar still processing
Polling for Completion
Adding Audio with Lip-Sync
Webhooks
Subscribe to video completion events:video.processing- Video generation startedvideo.completed- Video ready for downloadvideo.failed- Generation failed
Best Practices
Optimize Costs
Optimize Costs
- Keep videos under 5 seconds to avoid per-second charges
- Use Basic tier for testing, Reality Lab for production
- Batch similar videos for workflow efficiency
- Reuse successful configurations
Input Quality
Input Quality
- Use high-resolution avatars (1024x1024+)
- Ensure clean, well-lit images
- Center subject in frame
- Avoid heavily cropped faces
Performance
Performance
- Generate during off-peak hours for faster processing
- Use webhooks instead of polling for long videos
- Download and cache completed videos
- Compress for web delivery if needed
Rate Limits
| Tier | Concurrent Generations | Daily Limit |
|---|---|---|
| Free | 1 | 10 videos |
| Pro | 3 | 100 videos |
| Enterprise | 10 | Unlimited |