MiniMax M2.7: Open-Source AI Agent Matches Top Proprietary Models

MiniMax's M2.7 is a free, open-source AI agent model that rivals GPT-5.3 and Claude Opus on real-world coding tasks. Here's what it means.

AI Tutorials · · 2 min read

Quick answer

MiniMax has open-sourced M2.7, a self-evolving AI agent model that scores 56.22% on SWE-Pro and 57.0% on Terminal Bench 2 — matching proprietary models from OpenAI and Anthropic on real-world software engineering tasks. The weights are freely available on Hugging Face.

Chinese AI company MiniMax has open-sourced M2.7, a new AI agent model that matches the best proprietary models on real-world software engineering tasks — and you can download it right now.

What Makes M2.7 Different

Most AI coding benchmarks test models on tidy algorithm puzzles. M2.7 was built for messier, more realistic work: debugging production code, analysing server logs, reviewing code for security flaws, and managing machine learning workflows.

On SWE-Pro — a benchmark that mirrors real software engineering scenarios — M2.7 scores 56.22%, matching OpenAI’s GPT-5.3-Codex. On Terminal Bench 2, which tests command-line problem solving, it hits 57.0%. Among open-source models, it holds the top ELO rating on GDPval-AA, trailing only Claude Opus 4.6, Sonnet 4.6, and GPT-5.4 overall.

The Self-Evolving Angle

Here is the part that caught researchers’ attention. M2.7 is the first model to actively participate in its own training. During development, MiniMax let the model run over 100 autonomous rounds of scaffold optimisation — essentially letting it fine-tune how it approaches tasks. The result was a 30% performance improvement driven by the model itself, not just by human engineers tweaking parameters.

MiniMax says this points toward a future where models can continuously improve with less human intervention.

A Licensing Wrinkle

There is one thing to watch. Shortly after the initial release, MiniMax quietly changed the license terms, drawing criticism from the open-source community. If you plan to use M2.7 commercially, check the current license on Hugging Face before building anything on top of it.

What This Means for You

If you use AI for coding — whether through Claude Code, Cursor, or other tools — M2.7 is worth paying attention to. An open-source model performing at the level of GPT-5.3 means more competition, lower prices, and more options for developers who want to run models locally without sending code to the cloud.

You can try M2.7 through Ollama for local use or via API providers like OpenRouter. For context on how it compares to other coding AI tools, see our Claude Code guide. And if you are just getting started with AI-assisted development, visit our beginner’s guide.

Frequently asked questions

What is MiniMax M2.7?
MiniMax M2.7 is an open-source AI agent model designed for real-world software engineering tasks like debugging, code review, and log analysis. It matches the performance of leading proprietary models like GPT-5.3-Codex and Claude Opus on key benchmarks.
Is MiniMax M2.7 free to use?
Yes, the model weights are available on Hugging Face and Ollama. You can run it locally or access it through API providers like OpenRouter. However, MiniMax changed the license after initial release, so check the current terms before commercial use.
What does self-evolving mean for an AI model?
MiniMax M2.7 participated in over 100 rounds of its own training optimisation, autonomously improving its scaffolding and performance by 30%. It is the first model to actively contribute to its own development process.
How does MiniMax M2.7 compare to Claude and GPT?
On SWE-Pro, M2.7 scores 56.22% — nearly matching Claude Opus 4.6 and on par with GPT-5.3-Codex. On Terminal Bench 2 it scores 57.0%. Among open-source models, it holds the highest ELO rating on GDPval-AA.

Want to keep learning?

Explore our guided learning paths or try building something with AI right now.

Enjoyed this article?

Subscribe for more AI insights delivered to your inbox every week.

No spam. Unsubscribe anytime.