Google Gemini Omni: New AI Agent Combines Text, Images & Video (2026)

A new Gemini Omni banner spotted in Google’s web build hints at a powerful multimodal AI agent with avatar support and it may launch as soon as today’s Android show.

Contents

What is Gemini Omni?
AI Avatars and the “Likeness” feature
Could it launch at today’s Android show?
Why this matters

Google appears to be quietly building something significant inside Gemini. A newly discovered banner in Google’s web build reveals a feature called Gemini Omni and based on what’s been spotted, it looks like a full-fledged multimodal AI agent capable of working across text, images, and video simultaneously.

What is Gemini Omni?

Gemini Omni is shaping up to be more than just a chatbot upgrade. According to the leaked banner details, it will operate as an AI agent meaning it can take actions, combine multiple media types, and work more autonomously than traditional AI assistants.

The three core capabilities revealed so far:

Text

Conversational AI responses

Images

Visual understanding & creation

Videos

Video generation & editing

AI Avatars and the “Likeness” feature

One of the most intriguing aspects of Gemini Omni is its connection to AI Avatars, also known as the Likeness feature. This will allow users to insert themselves into different scenes essentially placing a digital version of yourself inside AI-generated content.

Google has already announced that AI Avatars are coming to Gemini, and Gemini Omni is expected to be deeply integrated with this capability. The Likeness feature is anticipated to be strongly tied to mobile apps, working similarly to how the feature operated on Sora, OpenAI’s video generation platform.

“Gemini Omni will be an Agent that can combine text, images, and videos. Users will be able to add themselves to different scenes.”

Could it launch at today’s Android show?

The timing of this discovery is notable. The banner was spotted just ahead of Google’s Android show raising the possibility that Gemini Omni could be officially announced or even launched during the event. While nothing is confirmed, the fact that it already appears in the live web build suggests it is very close to a public release.

Why this matters

If Gemini Omni lives up to what the leaked banner suggests, it would represent a major leap for Google’s AI ambitions. Most current AI tools handle text, image, or video in isolation. A true multimodal agent that combines all three while also letting users place themselves into generated scenes would put Google in direct competition with OpenAI’s Sora and other advanced generative AI platforms.

We will be watching the Android show closely. Stay tuned to thefyptt.com for updates as they happen.

Google Gemini Omni Is Becoming an AI Agent Can Combine Text, Images, and Videos

What is Gemini Omni?

AI Avatars and the “Likeness” feature

Could it launch at today’s Android show?

Why this matters

Leave a Reply Cancel reply

Most Popular

Google Photos Launches Video Remix AI-Powered Video Editing With Gemini Omni

Nokia and Google Cloud Are Building AI Agents That Fix Telecom Networks Automatically Up to 80% Faster

What Is an AEO Insights Company? Complete 2026 Guide to AI Search Visibility, Citation Tracking & Choosing the Right Partner

Anthropic Introduces Claude Sonnet 5 The Most Agentic Sonnet Model Yet

Google Gemini 3.5 Flash Now Supports Computer Use Build AI Agents That Control Browser, Mobile & Desktop

Categories

Quick Links

What is Gemini Omni?

AI Avatars and the “Likeness” feature

Could it launch at today’s Android show?

Why this matters

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.

Leave a Reply Cancel reply

Most Popular

You Might Also Like

Categories

Quick Links