🎄 Holiday Special: Get +25% FREE CREDITS on all top-ups until Jan 5th. Add Credits

Models Chat Benchmarks Docs Blog Changelog About Us

Back

MiniMax: MiniMax M2.1

minimax/minimax-m2.1

MiniMax-M2.1 is a lightweight, state-of-the-art large language model optimized for coding, agentic workflows, and modern application development. With only 10 billion activated parameters, it delivers a major jump in real-world capability while maintaining exceptional latency, scalability, and cost efficiency.

Compared to its predecessor, M2.1 delivers cleaner, more concise outputs and faster perceived response times. It shows leading multilingual coding performance across major systems and application languages, achieving 49.4% on Multi-SWE-Bench and 72.5% on SWE-Bench Multilingual, and serves as a versatile agent “brain” for IDEs, coding tools, and general-purpose assistance.

To avoid degrading this model's performance, MiniMax highly recommends preserving reasoning between turns.

ByminimaxInput typeOutput typePublish time2025-12-22

Recent activity on MiniMax M2.1

Tokens processed per day

Throughput

(tokens/s)

Providers	Min (tokens/s)	Max (tokens/s)	Avg (tokens/s)
MiniMax	24.04	52.54	34.99

First Token Latency

(ms)

Providers	Min (ms)	Max (ms)	Avg (ms)
MiniMax	1964	2927	2504.77

Providers for MiniMax M2.1

ZenMux Provider to the best providers that are able to handle your prompt size and parameters, with fallbacks to maximize uptime.

MiniMax

Latency

3.58

Throughput

35.73

tps

Uptime

100.00

Recent uptime

Jan 05,2026 - 7 PM100.00%

Price

Input

$ 0.3/ M tokens

Output

$ 1.2/ M tokens

Cache read

$ 0.03/ M tokens

Cache write 5m

Cache write 1h

Cache write

$ 0.375/ M tokens

Web search

Image

reasoning

Video

Audio

Audio & Video

Model limitation

Context

204.80K

Max output

131.07K

Supported Parameters

max_completion_tokens

temperature

top_p

frequency_penalty

presence_penalty

seed

logit_bias

logprobs

top_logprobs

response_format

stop

tools

tool_choice

parallel_tool_calls

Model Protocol Compatibility

OpenAI Chat Completions

OpenAI Responses

Anthropic Messages

Google VertexAI

Sample code and API for MiniMax M2.1

ZenMux normalizes requests and responses across providers for you.

OpenAI: Python-SDK

OpenAI: cURL

Anthropic: Python-SDK

Anthropic: cURL

python
from openai import OpenAI  
  
client = OpenAI(  
  base_url="https://zenmux.ai/api/v1",  
  api_key="<ZENMUX_API_KEY>",  
)  
  
# Chat Completion  
completion = client.chat.completions.create(  
  model="minimax/minimax-m2.1",  
  messages=[  
    {  
      "role": "user",  
      "content": "What is the meaning of life?"  
    }  
  ]  
)  
print(completion.choices[0].message.content)