Publish time
-Model Series
Input type
Output type
Context Window
1,000,000Max Output Length
32,768Input Price
¥0.8 / 1M tokensOutput Price
¥2 / 1M tokens能力均衡,推理效果和速度介于通义千问-Max和通义千问-Turbo之间,适合中等复杂任务。
Zhinao API routes requests to the best-fit provider and automatically fails over to the one with highest availability.
TTFT
No data
Throughput
4.63tps
Uptime
100.00%
Provider Model
sophnet/qwen-plus-latest
Supported Parameters
Recent Uptime
Reasoning
-
Supported Response Formats
Request Log Collection
-
Distillable
-
Total Context
1,000,000
Max Output
32,768
Input Price
¥0.8 / 1M tokens
Output Price
¥2 / 1M tokens
TTFT
0.01s
Throughput
5.06tps
Uptime
100.00%
Provider Model
aliyun/qwen-plus-latest
Supported Parameters
Recent Uptime
Reasoning
-
Supported Response Formats
Request Log Collection
-
Distillable
-
Total Context
1,000,000
Max Output
32,768
Input Price
¥0.8 / 1M tokens
Output Price
¥2 / 1M tokens
Compare different providers across Zhinao API
16.69 tok/s
0.28 s
Uptime for alibaba/qwen-plus-latest across all providers
Zhinao API normalizes requests and responses across providers for you
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://api.360.cn/v1",
apiKey: process.env.ZHINAO_API_KEY,
});
const response = await client.chat.completions.create({
model: "alibaba/qwen-plus-latest",
messages: [
{ role: "user", content: "Hello, how are you?" }
],
temperature: 0.7,
max_tokens: 1000,
});
console.log(response.choices[0].message.content);