Zhinao API routes requests to the best-fit provider and automatically fails over to the one with highest availability.
TTFT
1.58s
Throughput
23.56tps
Uptime
99.00%
Provider Model
huaweicloud/qwen3-235b-a22b
Supported Parameters
Recent Uptime
Reasoning
-
Supported Response Formats
Request Log Collection
-
Distillable
-
Total Context
65,536
Max Output
31,000
Input Price
¥2 / 1M tokens
Output Price
¥8 / 1M tokens
TTFT
1.67s
Throughput
19.97tps
Uptime
99.00%
Provider Model
qiniu/qwen3-235b-a22b
Supported Parameters
Recent Uptime
Reasoning
-
Supported Response Formats
Request Log Collection
-
Distillable
-
Total Context
65,536
Max Output
31,000
Input Price
¥2 / 1M tokens
Output Price
¥8 / 1M tokens
TTFT
1.18s
Throughput
7tps
Uptime
100.00%
Provider Model
sophnet/qwen3-235b-a22b
Supported Parameters
Recent Uptime
Reasoning
-
Supported Response Formats
Request Log Collection
-
Distillable
-
Total Context
65,536
Max Output
31,000
Input Price
¥2 / 1M tokens
Output Price
¥8 / 1M tokens
TTFT
1.92s
Throughput
30.48tps
Uptime
91.00%
Provider Model
alibaba/qwen3-235b-a22b
Supported Parameters
Recent Uptime
Reasoning
-
Supported Response Formats
Request Log Collection
-
Distillable
-
Total Context
65,536
Max Output
16,384
Input Price
¥2 / 1M tokens
Output Price
¥20 / 1M tokens
Compare different providers across Zhinao API
22.26 tok/s
1.63 s
Uptime for qwen3-235b-a22b across all providers
Zhinao API normalizes requests and responses across providers for you
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://api.360.cn/v1",
apiKey: process.env.ZHINAO_API_KEY,
});
const response = await client.chat.completions.create({
model: "qwen3-235b-a22b",
messages: [
{ role: "user", content: "Hello, how are you?" }
],
temperature: 0.7,
max_tokens: 1000,
});
console.log(response.choices[0].message.content);