Hugging Face has just introduced a 30B-A3B reasoning model, achieving impressive gold medal-level results in the Physics (IPhO) and Mathematics (IMO/USAMO) Olympiad benchmarks.
Details
The model achieves "Gold-medal level" results directly in IPhO, and in IMO/USAMO through a self-verification and refinement mechanism at test-time. This success stems from a simple and unified "scaling recipe" for proof search, enabling the model to solve complex problems that require high-level logical reasoning.
Why It Matters
The fact that large language models (LLMs) can conquer elite competitions like the IMO and IPhO demonstrates a massive leap forward in AI's logical reasoning capabilities. For the AI research and development community in Vietnam, this serves as proof that optimizing search algorithms and scaling can lead to intellectual breakthroughs without solely relying on extreme parameter scale expansion.