Gemini Omni API Model Guide

TL;DR

Flash-oriented positioning optimized for quick generation, low latency, and high iteration speed.
Well suited to prototyping, social short-form content, and batch output without a heavyweight production setup.
Uses async video task workflows: submit first, then poll task status for results.

Core Capabilities

Low-latency output：Built for workflows that need fast feedback and frequent prompt changes.
Short-form batch generation：Fits short videos, multi-variant testing, and social creative production.
Unified video task contract：Reuses the shared ToAPIs video task interface for easier production handoff.

When to Use

When validating concepts, scripts, or motion directions quickly.
When producing social short-form video or multiple creative variants at scale.
When low-latency feedback is important for rapid prompt iteration.

When Not to Use

When polished showcase quality and maximum visual stability matter more than speed.
When you require synchronous immediate responses instead of async task workflows.

Runtime Behavior

Runs as an async video task flow, returning task state first rather than the final video file.
Best suited to short durations, quick feedback cycles, and repeated parameter tuning.
Fits server-side queues and production workflows via shared video-task polling patterns.

Minimal Request

{
  "model": "gemini_omni",
  "prompt": "A cinematic shot of a cat running through neon rain",
  "aspect_ratio": "16:9",
  "duration": 6,
  "resolution": "720P"
}

Minimal Response

{
  "id": "video_01JZEXAMPLE",
  "object": "generation.task",
  "model": "gemini_omni",
  "status": "queued",
  "created_at": 1779247407
}

Key Parameters

Parameter	Type	Required	Default	Range	Description
model	string	Yes	gemini_omni	-	Model identifier. Use the fixed value `gemini_omni`.
prompt	string	Yes	-	-	Main instruction describing the subject, action, scene, and visual style.
aspect_ratio	string	No	16:9	16:9	9:16
duration	integer	No	6	6	10
resolution	string	No	720P	720P	Output resolution. Currently only `720P` is supported; if omitted, the default value is used.

Common Errors

HTTP	Code	Trigger	Fix Action	Retry Policy
400	invalid_request_error	Request payload misses required fields or has invalid field types.	Validate model, mode, and input schema before request.	Retry only after fixing payload; avoid blind retries.
401	authentication_error	Missing Authorization header or invalid API key.	Verify Bearer token format and API key scope.	Retry after auth fix; rotate key if repeated failures continue.
429	rate_limit_exceeded	Request rate, concurrency, or current quota hits upstream rate limiting.	Apply exponential backoff first, then review request rate, concurrency, and quota usage.	Use exponential backoff (e.g., 1s/2s/4s) with jitter; if it persists, reduce submission pressure.
500	internal_error	Transient upstream error or internal processing failure.	Capture request id and trigger retry workflow.	Retry 2-3 times with short delays; escalate if persistent.

FAQ

What is Gemini Omni best for?
It is best suited to prototyping, social short-form video, batch generation, and fast creative iteration where low latency matters.
Image or video model error: invalid apitype: -1
This usually means the request was sent to the wrong endpoint. Image and video models typically do not use the chat endpoint. Instead, submit the documented HTTP task request and poll the task status endpoint for results. Check the actual request code, URL, and payload first.
An image or video task failed, but the user was still charged
Ask the user for the task log or screenshot first and check whether input or output token usage appears. If token accounting shows up, the request was likely sent through a chat endpoint instead of the proper media workflow. Image and video models usually run as async HTTP task APIs: submit the task first, then poll by task id according to the relevant docs.

Gemini Omni Full Guide (Markdown)

Gemini Omni API Model Guide

TL;DR

Core Capabilities

When to Use

When Not to Use

Runtime Behavior

Minimal Request

Minimal Response

Key Parameters

Common Errors

FAQ

Related APIs