🎄 Holiday Special: Get +25% FREE CREDITS on all top-ups until Jan 5th. Add Credits
logologo
ModelsChatBenchmarksDocsBlogChangelogAbout Us
Back
minimax/minimax-m2.1

MiniMax: MiniMax M2.1

minimax/minimax-m2.1

MiniMax-M2.1 is a lightweight, state-of-the-art large language model optimized for coding, agentic workflows, and modern application development. With only 10 billion activated parameters, it delivers a major jump in real-world capability while maintaining exceptional latency, scalability, and cost efficiency.

Compared to its predecessor, M2.1 delivers cleaner, more concise outputs and faster perceived response times. It shows leading multilingual coding performance across major systems and application languages, achieving 49.4% on Multi-SWE-Bench and 72.5% on SWE-Bench Multilingual, and serves as a versatile agent “brain” for IDEs, coding tools, and general-purpose assistance.

To avoid degrading this model's performance, MiniMax highly recommends preserving reasoning between turns.

ByminimaxInput typeOutput typePublish time2025-12-22

Recent activity on MiniMax M2.1

Tokens processed per day

Throughput

(tokens/s)
ProvidersMin (tokens/s)Max (tokens/s)Avg (tokens/s)
MiniMax24.0452.5434.99

First Token Latency

(ms)
ProvidersMin (ms)Max (ms)Avg (ms)
MiniMax196429272504.77
Providers for MiniMax M2.1
ZenMux Provider to the best providers that are able to handle your prompt size and parameters, with fallbacks to maximize uptime.
MiniMax

MiniMax

Latency
3.58
s
Throughput
35.73
tps
Uptime
100.00
%
Recent uptime
Jan 05,2026 - 7 PM100.00%
Price
Input
$ 0.3/ M tokens
Output
$ 1.2/ M tokens
Cache read
$ 0.03/ M tokens
Cache write 5m
-
Cache write 1h
-
Cache write
$ 0.375/ M tokens
Web search
-
Image
-
reasoning
-
Video
-
Audio
-
Audio & Video
-
Model limitation
Context
204.80K
Max output
131.07K
Supported Parameters
max_completion_tokens
temperature
top_p
frequency_penalty
-
presence_penalty
-
seed
-
logit_bias
-
logprobs
-
top_logprobs
-
response_format
stop
-
tools
tool_choice
parallel_tool_calls
-
Model Protocol Compatibility
OpenAI Chat Completions
OpenAI Responses
-
Anthropic Messages
Google VertexAI
-

Sample code and API for MiniMax M2.1

ZenMux normalizes requests and responses across providers for you.
OpenAI: Python-SDK
OpenAI: cURL
Anthropic: Python-SDK
Anthropic: cURL
python
from openai import OpenAI  
  
client = OpenAI(  
  base_url="https://zenmux.ai/api/v1",  
  api_key="<ZENMUX_API_KEY>",  
)  
  
# Chat Completion  
completion = client.chat.completions.create(  
  model="minimax/minimax-m2.1",  
  messages=[  
    {  
      "role": "user",  
      "content": "What is the meaning of life?"  
    }  
  ]  
)  
print(completion.choices[0].message.content)