NVIDIA launches Vera Rubin platform — processing trillion-parameter models at 400 tokens per second
NVIDIA's new Vera Rubin platform, combining NVL72 and Groq 3 LPX, enables running agentic workloads on massive MoE models without sacrificing latency.