# Google 推出 Gemini Omni，一款專注於電影級影片生成與對話式編輯的多模態 AI 模型。

*genai · news · 2026-05-21 · The Economic Times*

## Key points

- Google 推出 Gemini Omni，一款用於電影級影片生成與對話式編輯的多模態 AI 模型。
- Gemini Omni 允許使用自然語言指令編輯影片，並在編輯過程中維持場景連貫性。
- 所有使用 Gemini Omni 生成的 AI 影片均包含 Google 的 SynthID 水印，以確保合成媒體的可追蹤性。
- Gemini Omni Flash 將初步在 Gemini 應用程式、Google Flow、YouTube Shorts 及 YouTube Create 上提供。

Synopsis Google has launched Gemini Omni, a multimodal AI model focused on cinematic video generation and conversational editing. The platform can create videos using text, images, audio and video prompts with improved realism and context awareness. Google is positioning Omni as its next big push in AI-powered content creation. Google has introduced Gemini Omni, a new family of multimodal AI models aimed at transforming how users create and edit video content, marking the company’s latest push to expand artificial intelligence beyond text-based assistants and into full-scale creative production workflows. The first model in the lineup, Gemini Omni Flash, is designed to generate cinematic videos using combinations of text, images, audio and video prompts. Unlike traditional AI video tools that largely rely on isolated prompts, Google says Omni can reason across multiple forms of input simultaneously to produce more coherent and context aware outputs. The launch comes as competition in generative AI rapidly intensifies, with companies racing to build platforms capable of handling increasingly complex creative and enterprise tasks. AI-generated video has emerged as one of the fastest-growing segments within the broader AI ecosystems, attracting interest from creators, marketers, studios and enterprises seeking faster and more scalable production pipelines. A key feature highlighted by Google is Omni’s conversational editing capability. Users can modify videos through natural language instructions such as changing environments, adjusting camera movements, adding visual effects or transforming artistic styles while maintaining continuity across scenes. The system also supports interactive editing allowing users to refine outputs across multiple prompts without restarting the workflow. Google says the model demonstrates stronger “world understanding", enabling more realistic rendering of motion, lighting and environmental interactions. The company claims the system better interprets concepts such as gravity, movement and spatial consistency, areas that have traditionally remained challenging for generative video models. Live Events Gemini Omni also builds on the momentum created by Google’s widely discussed AI image model “Nano Banana” officially known as Gemini Flash Image. The tool gained traction for its conversational image editing capabilities, allowing users to generate and modify visuals using natural language prompts while maintaining character consistency and realism. Industry observers view Gemini Omni as Google’s attempt to extend the same intuitive creative workflow from static images into full scale video generation and editing. The platform additionally supports reference-based generation, allowing users to upload sketches, images, existing footage or audio clips that can then be transformed into stylised or photorealistic videos. To address concerns around synthetic media and deepfakes, all the AI-generated videos created through Gemini Omni will include Google’s SynthID watermarking technology. Gemini Omni Flash will initially roll out across the Gemini app, Google Flow, YouTube Shorts and YouTube Create, with developer and enterprise API access expected later. As AI competition increasingly moves beyond chatbots and search into creative production, Gemini Omni signals Google’s ambition to become a major player in AI-powered media creation. With conversational video editing, multimodal generation and tighter integration across YouTube and Gemini products, the company is positioning AI not just as an assistant, but as a full-scale creative engine for the next phase of digital content creation. Nominate for ET AI Awards Disclaimer Statement: This content is authored by a 3rd party. The views expressed here are that of the respective authors/ entities and do not represent the views of Economic Times (ET). ET does not guarantee, vouch for or endorse any of its contents nor is responsible for them in any manner whatsoever. Please take all steps necessary to ascertain that any information and content provided is correct, updated, and verified. ET hereby disclaims any and all warranties, express or implied, relating to the report and any content therein. An unusual skill shortage that’s stalling govt’s PNG push Is Air India the cocktail turning bitter for Tatas? Can GIFT City open doors for Indians missing Korea, Taiwan AI rally? After Trump tariff, is Indian aviation next US target? How this man proved M&As need not be disasters even if tough Stock Radar: Laurus Labs stock hits a fresh record high in May 2026; will the rally continue? 1 2 3

**Companies:** Google
**Countries:** United States

[Read the full story on The Economic Times](https://economictimes.indiatimes.com/ai/ai-insights/google-unveils-gemini-omni-to-push-ai-beyond-chatbots-into-full-scale-video-creation/articleshow/131245633.cms)

---

Canonical: https://newsio.io/zh-TW/n/7d488674-36b0-43b5-97dc-600d9096fc01/google-gemini-omni-ai-gemini-omni-flash
Summarized by Newsio from The Economic Times. https://newsio.io/how-it-works