Models · TechCrunch AI ·
Google’s Gemini Omni turns images, audio, and text into video — and that’s just the start
Google's Gemini Omni is a new multimodal model that can reason across text, images, audio, and video to generate and edit videos through conversation, beginning with Omni Flash.