NVIDIA Announces Cosmos 3: Omnimodal World Models for Physical AI
NVIDIA Cosmos 3 is a unified world model capable of understanding and generating language, images, video, audio, and actions for robotics.
Sources x.com
NVIDIA Cosmos 3 is a unified world model capable of understanding and generating language, images, video, audio, and actions for robotics.
Startup DAIMON Robotics has announced Daimon-Infinity, the world's largest multimodal tactile dataset, aiming to bring a sensitive sense of touch to robots.
Wetour Robotics introduces Spatial Intent Fusion, a solution combining spatial, visual, and gesture data to control physical devices with under-100ms latency.
At the Physical AI Hackathon, the 'Panda Master' project made an impression by combining the ReachyMini robot, a GPT model, and an Agilex robotic arm to converse with users and draw 'fortunes' for them.