Generate high-quality videos from text descriptions with Google’s Veo 3.1 Fast model. Supports multiple resolutions, durations, and optional audio generation.
Property Value Provider Google Model Veo 3.1 Fast Capability Text to Video Base Cost 200,000 micro-cents/second ($0.20/sec) Processing Time ~240 seconds
Request Body
Model slug. Use google/veo-3.1-fast/text-to-video for text-to-video generation.
Input parameters for text-to-video generation. Text description of the video to generate (max 4000 characters).
Video aspect ratio. Default: 16:9. Options: 16:9, 9:16.
Output resolution. Default: 720p. Options: 720p, 1080p, 4k.
Video duration in seconds. Default: 8. Options: 4, 6, 8.
Enable audio generation. Default: true. Options: true, false.
Seed for reproducible results.
HTTPS URL to receive a webhook notification when the job completes or fails.
Pricing
Base cost: 200,000 micro-cents per second ($0.20/sec)
finalCost = baseCost × duration × resolution × has_sound
Factor Option Multiplier Duration 44x 66x 88x Resolution 720p1x 1080p1x 4k1.5x Sound false1x true1.5x
Default cost: 8 seconds, 720p, with sound = 200,000 × 8 × 1 × 1.5 = 2,400,000 micro-cents ($2.40)
Response
Unique identifier for the submitted job.
Initial job status. Always "pending" on successful submission.
ISO 8601 timestamp of the estimated completion time.
The cost of the job in micro-cents.
Code Examples
curl -X POST https://api.muvi.video/v1/jobs/submit \
-H "Authorization: Bearer $PIXELBYTE_API_KEY " \
-H "Content-Type: application/json" \
-d '{
"model": "google/veo-3.1-fast/text-to-video",
"input": {
"prompt": "A golden retriever running through a sunlit meadow",
"aspect_ratio": "16:9",
"resolution": "1080p",
"duration": "8",
"has_sound": "true"
}
}'