Models · MarkTechPost ·

NVIDIA Releases Cosmos 3: A Two-Tower Mixture-of-Transformers Foundation Model Unifying Physical Reasoning, World Generation, and Action Generation

NVIDIA Releases Cosmos 3: A Two-Tower Mixture-of-Transformers Foundation Model Unifying Physical Reasoning, World Generation, and Action Generation

NVIDIA released Cosmos 3, an omnimodal world model that combines an autoregressive vision-language model reasoner with a diffusion-based generator for physical AI tasks. The system is designed to support physical reasoning, world generation, and action generation.

Read the full story at MarkTechPost →