Publish time
2026/2/12Model Series
MiniMaxInput type
Output type
Context Window
204,800Max Output Length
16,384Input Price
¥4.2 / 1M tokensOutput Price
¥16.8 / 1M tokensMiniMax M2.5 的更高吞吐版本,适合对延迟更敏感的生产场景。
Zhinao API routes requests to the best-fit provider and automatically fails over to the one with highest availability.
TTFT
0.75s
Throughput
24.24tps
Uptime
100.00%
Provider Model
minimax/minimax/MiniMax-M2.5-highspeed
Supported Parameters
Recent Uptime
Reasoning
Toggleable
Supported Response Formats
Request Log Collection
ZDR Supported
Distillable
Yes
Total Context
204,800
Max Output
16,384
Input Price
¥4.2 / 1M tokens
Output Price
¥16.8 / 1M tokens
Compare different providers across Zhinao API
37.04 tok/s
2.21 s
Uptime for minimax/MiniMax-M2.5-highspeed across all providers
Zhinao API normalizes requests and responses across providers for you
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://api.360.cn/v1",
apiKey: process.env.ZHINAO_API_KEY,
});
const response = await client.chat.completions.create({
model: "minimax/MiniMax-M2.5-highspeed",
messages: [
{ role: "user", content: "Hello, how are you?" }
],
temperature: 0.7,
max_tokens: 1000,
});
console.log(response.choices[0].message.content);