At Google I/O 2026, we announced our latest models: Gemini Omni and the Gemini 3.5 family of models.
Gemini Omni is our new model that can create anything from any input, starting with video. With Omni, you can combine images, audio, video and text as input and generate high-quality videos grounded in Gemini’s real-world knowledge. You can also easily edit your videos through conversation.
Then there’s Gemini 3.5, our latest family of models combining frontier intelligence with action. This represents a major leap forward in building more capable, intelligent agents. We’re kicking off the series by releasing 3.5 Flash. It delivers frontier performance for agents and coding, excelling at complex long-horizon tasks that deliver real-world utility.
To give you a clearer understanding of Gemini Omni and Gemini 3.5 Flash, here are 11 demos of what they can help you do.
Gemini Omni
Edit your videos through conversation. One capability that makes Omni special is that it gives you an easier way to edit video — with natural language. Every instruction builds on the last. Your characters stay consistent, the physics hold up and the scene remembers what came before. That means you can transform the world around you. Change specific things, or change everything. Your video becomes the starting point for something you never could have filmed yourself.

