MiniMax M3 is here: an open-weight model that combines a one-million-token context window, native multimodality, and coding performance that benchmarks above GPT-5.5 and Gemini 3.1 Pro. The weights will be publicly available. The impact on the AI market’s pricing and competitive dynamics is significant.
Key Takeaways
- MiniMax M3 is the first open-weight model to combine 1M-token context, native multimodality, and top-tier coding performance simultaneously.
- Its MiniMax Sparse Attention architecture cuts compute to one-twentieth and delivers 9x faster processing for long inputs.
- On SWE-Bench Pro, M3 scores 59%, placing it above GPT-5.5 and Gemini 3.1 Pro, just behind Claude Opus 4.7.
What MiniMax M3 Actually Delivers
The combination MiniMax claims for M3 does not currently exist in the open-weight ecosystem. A one-million-token context window. Native multimodality (text and image input at minimum). Coding benchmarks that compete with leading proprietary models. Until now, getting any one of these three attributes in an open-weight model meant sacrificing the others.
The technical foundation is a new attention variant called MiniMax Sparse Attention (MSA). Standard attention mechanisms compute relationships between every pair of tokens in a context, making the compute cost prohibitive for very long contexts. MSA calculates attention only for selected segments, reducing compute to one-twentieth of the standard method. Input processing for long sequences speeds up by a factor of nine.
On SWE-Bench Pro, the benchmark for autonomous software engineering tasks, M3 scores 59 percent. GPT-5.5 and Gemini 3.1 Pro score below that. Claude Opus 4.7 scores above. For a model whose weights will be freely available to anyone, this positioning is unprecedented.
The model is immediately available via API. Weights will be published shortly, allowing any organization to deploy M3 on its own infrastructure without relying on MiniMax’s servers. This distinction matters: an open-weight model can be fine-tuned, modified, and deployed in environments where external API dependencies are unacceptable.
The practical implications of a one-million-token context at this performance level are substantial. Complete codebase analysis. Processing full legal or scientific documents without chunking. Extended conversational memory over very long sessions. These are workflows that enterprise agent deployments have been waiting for through several development cycles.
The Geopolitical and Competitive Dimension
MiniMax is not a new entrant. M3 follows M2.5 and M2.7 in an accelerating release cadence. But M3 represents a scale shift: this is not a model competitive on one dimension, it is a model that simultaneously rivals proprietary leaders across multiple dimensions in a single open-weight package.
The timing is pointed. The AI oversight order signed by Trump this week was built largely on the argument that US AI regulation would hamper American competitiveness against China. M3 illustrates precisely the competitive landscape that argument invokes: a Chinese lab publishing a high-performance open-weight model that matches or beats the best American proprietary systems and makes it freely available globally.
The open-weight strategy is itself a competitive weapon. By publishing weights, MiniMax enables global adoption without friction, particularly in regions where access to US APIs is restricted, expensive, or regulated. The ecosystem of derivative applications built on M3 over the coming months represents value that proprietary model providers cannot control or recapture.
For AI companies approaching public markets, the rise of high-quality open-weight models is a valuation variable worth watching. If reference-class models become commodities, the premium applied to proprietary closed models compresses. The question is no longer “can open-weight be good?” but “at what point is it good enough to replace a paid API for most use cases?”
DeepSeek R1 produced a similar shock in early 2025. M3 follows the same logic but with a wider surface: coding, long context, and multimodality in a single open-weight model. The dynamic of price compression and standard elevation in AI has shown no sign of slowing.
Also on Horizon:
- Trump’s New AI Oversight Order: What Changed
- Alphabet Raises $80 Billion to Finance Its AI Infrastructure
- Claude Mythos Secures Critical Infrastructure in 15 Nations
Short and Medium-Term Implications
In the short term, M3 gives developers a credible open-weight option for use cases previously limited to expensive proprietary models. Long-context workflows and autonomous coding tasks benefit immediately. For organizations running high-volume AI workloads, self-hosting M3 instead of paying API fees represents a meaningful cost reduction.
Over the next six months, the publication of M3’s weights will fuel a wave of fine-tuning and sector-specific adaptation. Specialized models for law, medicine, finance, and industrial code built on M3 as a foundation will emerge across the community. This ecosystem of derivative models is the real strategic value of an open-weight release. It compounds in ways the initial benchmark numbers do not capture.
The open question is real-world performance at full context length. Benchmarks measure performance on specific tasks. Production use of one million tokens (sustained coherence, stable attention across the full window, consistent performance) is a different challenge. Community testing in the coming weeks will be the real evaluation.
The open-weight competition between US players (Meta with Llama) and Chinese labs (MiniMax, DeepSeek) is intensifying. In an environment where US regulatory oversight of advanced models is pulling back, the free publication of high-performance models by non-US actors raises governance questions that neither Washington nor Brussels has fully resolved.
Follow the story on Horizon.


