Models · TechCrunch AI ·

Google’s Gemini Omni turns images, audio, and text into video — and that’s just the start

Google’s Gemini Omni turns images, audio, and text into video — and that’s just the start

Google's Gemini Omni is a new multimodal model that can reason across text, images, audio, and video to generate and edit videos through conversation, beginning with Omni Flash.

Read the full story at TechCrunch AI →