minimax/MiniMax-M2.5-highspeed

Online Chat

MiniMax

Publish time

2026/2/12

Model Series

MiniMax

Input type

Output type

Context Window

204,800

Max Output Length

16,384

Input Price

¥4.2 / 1M tokens

Output Price

¥16.8 / 1M tokens

MiniMax M2.5 的更高吞吐版本，适合对延迟更敏感的生产场景。

Providers for minimax/MiniMax-M2.5-highspeed

Zhinao API routes requests to the best-fit provider and automatically fails over to the one with highest availability.

MiniMax

国内

TTFT

0.75s

Throughput

24.24tps

Uptime

100.00%

Provider Model

minimax/minimax/MiniMax-M2.5-highspeed

Supported Parameters

temperaturetop_ptop_k

Recent Uptime

5月24日 11 PM100.00%

Reasoning

Toggleable

Supported Response Formats

OpenAI Chat Completions

Request Log Collection

ZDR Supported

Distillable

Yes

Total Context

204,800

Max Output

16,384

Input Price

¥4.2 / 1M tokens

Output Price

¥16.8 / 1M tokens

Performance for minimax/MiniMax-M2.5-highspeed

Compare different providers across Zhinao API

Throughput

37.04 tok/s

TTFT

2.21 s

Uptime for minimax/MiniMax-M2.5-highspeed

Uptime for minimax/MiniMax-M2.5-highspeed across all providers

Sample code and API for minimax/MiniMax-M2.5-highspeed

Get API Key

Zhinao API normalizes requests and responses across providers for you

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.360.cn/v1",
  apiKey: process.env.ZHINAO_API_KEY,
});

const response = await client.chat.completions.create({
  model: "minimax/MiniMax-M2.5-highspeed",
  messages: [
    { role: "user", content: "Hello, how are you?" }
  ],
  temperature: 0.7,
  max_tokens: 1000,
});

console.log(response.choices[0].message.content);

minimax/MiniMax-M2.5-highspeed

Online Chat

MiniMax

Publish time

2026/2/12

Model Series

MiniMax

Input type

Output type

Context Window

204,800

Max Output Length

16,384

Input Price

¥4.2 / 1M tokens

Output Price

¥16.8 / 1M tokens

MiniMax M2.5 的更高吞吐版本，适合对延迟更敏感的生产场景。

Providers for minimax/MiniMax-M2.5-highspeed

Zhinao API routes requests to the best-fit provider and automatically fails over to the one with highest availability.

MiniMax

国内

TTFT

0.75s

Throughput

24.24tps

Uptime

100.00%

Provider Model

minimax/minimax/MiniMax-M2.5-highspeed

Supported Parameters

temperaturetop_ptop_k

Recent Uptime

5月24日 11 PM100.00%

Reasoning

Toggleable

Supported Response Formats

OpenAI Chat Completions

Request Log Collection

ZDR Supported

Distillable

Yes

Total Context

204,800

Max Output

16,384

Input Price

¥4.2 / 1M tokens

Output Price

¥16.8 / 1M tokens

Performance for minimax/MiniMax-M2.5-highspeed

Compare different providers across Zhinao API

Throughput

37.04 tok/s

TTFT

2.21 s

Uptime for minimax/MiniMax-M2.5-highspeed

Uptime for minimax/MiniMax-M2.5-highspeed across all providers

Sample code and API for minimax/MiniMax-M2.5-highspeed

Get API Key

Zhinao API normalizes requests and responses across providers for you

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.360.cn/v1",
  apiKey: process.env.ZHINAO_API_KEY,
});

const response = await client.chat.completions.create({
  model: "minimax/MiniMax-M2.5-highspeed",
  messages: [
    { role: "user", content: "Hello, how are you?" }
  ],
  temperature: 0.7,
  max_tokens: 1000,
});

console.log(response.choices[0].message.content);