Bỏ qua đến nội dung chính
Back to home
AI 1 min read

NVIDIA Cosmos 3 Tops Artificial Analysis Leaderboards for Open Weights Models

NVIDIA's Cosmos 3 has secured the #1 spot in both Text-to-Image and Image-to-Video categories on the Artificial Analysis leaderboards. It is a family of omnimodal world models for Physical AI, unifying language, image, video, audio, and action.

Tier 1 · sources 81% confidence Reviewed
Sources x.com

NVIDIA Cosmos 3 has officially taken the lead on the Artificial Analysis leaderboards for open weights models. The model family now ranks first in two critical categories: Text-to-Image and Image-to-Video.

Designed as "Omnimodal World Models" for Physical AI, Cosmos 3 represents a significant step forward in unifying various modalities. It integrates language, images, video, audio, and action sequences within a single architecture. This capability is expected to drive advancements in robotics and AI systems that require a deep understanding of the physical world and real-world interactions.