Gemini Omni
by Unknown
About
Introduced at Google I/O 2026, Gemini Omni is Google DeepMind's unified multimodal AI video model, dubbed the 'Nano Banana for video'. It processes any combination of text, images, audio, and video inputs to generate high-quality, physics-grounded video content. Its standout feature is conversational multi-turn video editing, allowing users to seamlessly edit scenes, swap backgrounds, generate native audio, and maintain character consistency using AI avatars via simple chat instructions.
Features
- Any-to-Any multimodal video generation
- Conversational multi-turn video editing
- Grounded physics and realistic world understanding
- Consistent AI avatars and character preservation
- Native audio generation and background swapping
Supported Platforms
webmobile