Hi everyone, it's really great to be here. Over the past year, AI capabilities have leaped forwards. We now have agents that can plan and act on our behalf, and Artificial General Intelligence is just a few years away. Today, I'm excited to share the progress we've made towards building AGI. Last year, I outlined our vision of extending Gemini's incredible multimodal capabilities to become a world model- AI that can understand and simulate the world. This is a crucial aspect of achieving AGI, and will be important for everything from building AI assistants to training robots. Now, we're taking the next big step.
I'm excited to announce Gemini Omni. Our new model that can create anything from any input. It combines Gemini's intelligence with the best of our generative media models for a new level of world understanding, multimodality, and editing. Models like Veo, Nano Banana, and Genie are able to create extremely realistic videos, images, and interactive simulations. Although not perfect, they already demonstrate some impressive notions of intuitive physics.
And with Omni, we've now made even more progress. It's a step-change in simulating things like kinetic energy and gravity. Previous systems would have found these concepts difficult. Gemini's world knowledge and reasoning really shine in Omni. It can translate complex ideas into highly accurate videos. So for example, you can give it a simple prompt like "make a claymation explainer of protein folding" and get this: (Narrator in claymation video): Proteins start as chains of amino acids.
They fold into patterns like the alpha helix and flat sections called beta sheets, forming a perfect three-dimensional shape. Demis Hassabis: But the initial generation is just the start. The creative process is rarely a single step, it's usually iterative. Just like Nano Banana redefined image editing, Omni gives you a more natural way to edit video with conversational language.
What's really cool is you can give it your own videos-for example, this selfie- and change reality in a really fun way. You can easily adjust the details and style, or even add elements, and the whole scene morphs to reflect your new idea. A simple circle turns into a black hole, or an evening stroll comes to life. Anything becomes a canvas for creating entirely new realities. Let's take a look at what Omni can do.
We're starting with video, but over time, Omni will be able to generate any output from any input. This was always our goal with Gemini, and why we built it to be multimodal from the very start. It was a hard path, but the foundation is now paying off. Today, we're launching the first model in the Omni family: Gemini Omni Flash. It's now available across our products, and you'll hear more about this later. We're excited with the progress we're making, and we'll be able to share more about Omni Pro soon.
We can't wait to see what you create.