Zhinao API routes requests to the best-fit provider and automatically fails over to the one with highest availability.
TTFT
No data
Throughput
No data
Uptime
No data
Provider Model
360/huaweicloud-deepseek-r1
Supported Parameters
Recent Uptime
Reasoning
-
Supported Response Formats
Request Log Collection
-
Distillable
-
Total Context
65,536
Max Output
31,000
Input Price
¥4 / 1M tokens
Output Price
¥16 / 1M tokens
TTFT
8.60s
Throughput
20.15tps
Uptime
100.00%
Provider Model
360/huaweiyun-deepseek-r1
Supported Parameters
Recent Uptime
Reasoning
-
Supported Response Formats
Request Log Collection
-
Distillable
-
Total Context
65,536
Max Output
31,000
Input Price
¥4 / 1M tokens
Output Price
¥16 / 1M tokens
TTFT
0.14s
Throughput
0.01tps
Uptime
No data
Provider Model
volcengine/deepseek-r1
Supported Parameters
Recent Uptime
Reasoning
-
Supported Response Formats
Request Log Collection
-
Distillable
-
Total Context
65,536
Max Output
8,096
Input Price
¥4 / 1M tokens
Output Price
¥16 / 1M tokens
TTFT
5.29s
Throughput
20.72tps
Uptime
100.00%
Provider Model
huaweicloud/deepseek-r1
Supported Parameters
Recent Uptime
Reasoning
Toggleable
Supported Response Formats
Request Log Collection
ZDR Supported
Distillable
Yes
Total Context
65,536
Max Output
31,000
Input Price
¥2 / 1M tokens
Output Price
¥8 / 1M tokens
TTFT
6.03s
Throughput
20.03tps
Uptime
100.00%
Provider Model
paratera/deepseek-r1
Supported Parameters
Recent Uptime
Reasoning
-
Supported Response Formats
Request Log Collection
-
Distillable
-
Total Context
65,536
Max Output
31,000
Input Price
¥4 / 1M tokens
Output Price
¥16 / 1M tokens
TTFT
8.36s
Throughput
23.84tps
Uptime
99.00%
Provider Model
qiniu/deepseek-r1
Supported Parameters
Recent Uptime
Reasoning
Toggleable
Supported Response Formats
Request Log Collection
ZDR Supported
Distillable
Yes
Total Context
65,536
Max Output
8,096
Input Price
¥4 / 1M tokens
Output Price
¥16 / 1M tokens
TTFT
6.23s
Throughput
20.92tps
Uptime
100.00%
Provider Model
baidu/deepseek-r1-250528
Supported Parameters
Recent Uptime
Reasoning
-
Supported Response Formats
Request Log Collection
-
Distillable
-
Total Context
65,536
Max Output
8,096
Input Price
¥2 / 1M tokens
Output Price
¥8 / 1M tokens
TTFT
4.04s
Throughput
23.67tps
Uptime
100.00%
Provider Model
sophnet/deepseek-r1
Supported Parameters
Recent Uptime
Reasoning
-
Supported Response Formats
Request Log Collection
-
Distillable
-
Total Context
4,096
Max Output
2,048
Input Price
¥4 / 1M tokens
Output Price
¥16 / 1M tokens
TTFT
No data
Throughput
No data
Uptime
No data
Provider Model
guizhoumobile/deepseek-r1
Supported Parameters
Recent Uptime
Reasoning
-
Supported Response Formats
Request Log Collection
-
Distillable
-
Total Context
65,536
Max Output
8,096
Input Price
¥4 / 1M tokens
Output Price
¥16 / 1M tokens
Compare different providers across Zhinao API
20.38 tok/s
6.74 s
Uptime for deepseek-r1 across all providers
Zhinao API normalizes requests and responses across providers for you
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://api.360.cn/v1",
apiKey: process.env.ZHINAO_API_KEY,
});
const response = await client.chat.completions.create({
model: "deepseek-r1",
messages: [
{ role: "user", content: "Hello, how are you?" }
],
temperature: 0.7,
max_tokens: 1000,
});
console.log(response.choices[0].message.content);