About

Introduced at Google I/O 2026, Gemini Omni is Google DeepMind's unified multimodal AI video model, dubbed the 'Nano Banana for video'. It processes any combination of text, images, audio, and video inputs to generate high-quality, physics-grounded video content. Its standout feature is conversational multi-turn video editing, allowing users to seamlessly edit scenes, swap backgrounds, generate native audio, and maintain character consistency using AI avatars via simple chat instructions.

Features

Any-to-Any multimodal video generation
Conversational multi-turn video editing
Grounded physics and realistic world understanding
Consistent AI avatars and character preservation
Native audio generation and background swapping

Gemini Omni

About

Features

Supported Platforms

Links

Gemini Omni

About

Features

Supported Platforms

Links

Related Products