AI Jun 1, 2026 1 min read

MiniMax M3: Open-weight 1M context model challenges proprietary leaders

MiniMax has released M3, an open-weight model supporting a 1 million token context window, aiming to compete with leaders like Gemini 1.5 Pro and GPT-4o.

Tier 1 · sources 81% confidence Reviewed

Sources the-decoder.com

MiniMax, a leading Chinese AI unicorn, has unveiled M3, a large language model (LLM) capable of processing context up to 1 million tokens. Notably, M3 is released as an open-weight model, providing developers and researchers with flexible access to model weights—a strategic contrast to proprietary giants like OpenAI's GPT-4o and Google's Gemini 1.5 Pro.

Context

M3 utilizes an advanced Mixture-of-Experts (MoE) architecture, designed to optimize computational efficiency while handling massive datasets. The 1-million-token context window allows the model to process entire legal repositories, dozens of books, or complete codebases in a single query. Previously, high-fidelity long-context processing was primarily the domain of Google's Gemini 1.5 Pro.

Why it matters

The release of M3 highlights the rapid progress of Chinese AI firms in closing the gap with Silicon Valley. By opting for an open-weight strategy, MiniMax is positioning itself as a key alternative for developers who require massive context windows without being tethered to Big Tech ecosystems. Internal benchmarks suggest that M3 maintains high retrieval accuracy (needle-in-a-haystack) across its entire context range, potentially revolutionizing Retrieval-Augmented Generation (RAG) workflows for complex tasks.