deepseek-chat-v3

Online Chat

Publish time

Model Series

Input type

Output type

Context Window

65,536

Max Output Length

31,000

Input Price

¥2 / 1M tokens

Output Price

¥8 / 1M tokens

【 deepseek-V3-0324 新版 V3 模型借鉴 DeepSeek-R1 模型训练过程中所使用的强化学习技术，大幅提高了在推理类任务上的表现水平，在数学、代码类相关评测集上取得了超过 GPT-4.5 的得分成绩。

Providers for deepseek-chat-v3

Zhinao API routes requests to the best-fit provider and automatically fails over to the one with highest availability.

国内

TTFT

No data

Throughput

No data

Uptime

No data

Provider Model

volcengine/deepseek-v3-250324

Supported Parameters

Recent Uptime

5月14日 11 PM

Reasoning

Supported Response Formats

OpenAI Chat CompletionsOpenAI ResponsesAnthropic MessagesGoogle VertexAI

Request Log Collection

Distillable

Total Context

65,536

Max Output

8,096

Input Price

¥2 / 1M tokens

Output Price

¥8 / 1M tokens

七

七牛云

国内

TTFT

0.06s

Throughput

21.01tps

Uptime

100.00%

Provider Model

qiniu/deepseek-v3

Supported Parameters

Recent Uptime

5月14日 11 PM100.00%

Reasoning

Supported Response Formats

OpenAI Chat CompletionsOpenAI ResponsesAnthropic MessagesGoogle VertexAI

Request Log Collection

Distillable

Total Context

65,536

Max Output

8,096

Input Price

¥1.3 / 1M tokens

Output Price

¥5.2 / 1M tokens

百

百度文心

国内

TTFT

No data

Throughput

20.83tps

Uptime

100.00%

Provider Model

baidu/deepseek-v3

Supported Parameters

Recent Uptime

5月14日 11 PM100.00%

Reasoning

Supported Response Formats

OpenAI Chat CompletionsOpenAI ResponsesAnthropic MessagesGoogle VertexAI

Request Log Collection

Distillable

Total Context

65,536

Max Output

8,096

Input Price

¥1 / 1M tokens

Output Price

¥4 / 1M tokens

硅

硅基流动

国内

TTFT

4.85s

Throughput

12.02tps

Uptime

100.00%

Provider Model

siliconflow/deepseek-v3

Supported Parameters

Recent Uptime

5月14日 8 PM100.00%

Reasoning

Supported Response Formats

OpenAI Chat CompletionsOpenAI ResponsesAnthropic MessagesGoogle VertexAI

Request Log Collection

Distillable

Total Context

65,536

Max Output

8,096

Input Price

¥2 / 1M tokens

Output Price

¥8 / 1M tokens

腾

腾讯混元

国内

TTFT

0.35s

Throughput

2.41tps

Uptime

100.00%

Provider Model

tencent/deepseek-v3

Supported Parameters

Recent Uptime

5月12日 4 PM100.00%

Reasoning

Supported Response Formats

OpenAI Chat CompletionsOpenAI ResponsesAnthropic MessagesGoogle VertexAI

Request Log Collection

Distillable

Total Context

65,536

Max Output

8,096

Input Price

¥2 / 1M tokens

Output Price

¥8 / 1M tokens

Performance for deepseek-chat-v3

Compare different providers across Zhinao API

Throughput

18.64 tok/s

TTFT

1.77 s

Uptime for deepseek-chat-v3

Uptime for deepseek-chat-v3 across all providers

Sample code and API for deepseek-chat-v3

Get API Key

Zhinao API normalizes requests and responses across providers for you

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.360.cn/v1",
  apiKey: process.env.ZHINAO_API_KEY,
});

const response = await client.chat.completions.create({
  model: "deepseek-chat-v3",
  messages: [
    { role: "user", content: "Hello, how are you?" }
  ],
  temperature: 0.7,
  max_tokens: 1000,
});

console.log(response.choices[0].message.content);

deepseek-chat-v3

Online Chat

Publish time

Model Series

Input type

Output type

Context Window

65,536

Max Output Length

31,000

Input Price

¥2 / 1M tokens

Output Price

¥8 / 1M tokens

Providers for deepseek-chat-v3

Zhinao API routes requests to the best-fit provider and automatically fails over to the one with highest availability.

国内

TTFT

No data

Throughput

No data

Uptime

No data

Provider Model

volcengine/deepseek-v3-250324

Supported Parameters

Recent Uptime

5月14日 11 PM

Reasoning

Supported Response Formats

OpenAI Chat CompletionsOpenAI ResponsesAnthropic MessagesGoogle VertexAI

Request Log Collection

Distillable

Total Context

65,536

Max Output

8,096

Input Price

¥2 / 1M tokens

Output Price

¥8 / 1M tokens

七

七牛云

国内

TTFT

0.06s

Throughput

21.01tps

Uptime

100.00%

Provider Model

qiniu/deepseek-v3

Supported Parameters

Recent Uptime

5月14日 11 PM100.00%

Reasoning

Supported Response Formats

OpenAI Chat CompletionsOpenAI ResponsesAnthropic MessagesGoogle VertexAI

Request Log Collection

Distillable

Total Context

65,536

Max Output

8,096

Input Price

¥1.3 / 1M tokens

Output Price

¥5.2 / 1M tokens

百

百度文心

国内

TTFT

No data

Throughput

20.83tps

Uptime

100.00%

Provider Model

baidu/deepseek-v3

Supported Parameters

Recent Uptime

5月14日 11 PM100.00%

Reasoning

Supported Response Formats

OpenAI Chat CompletionsOpenAI ResponsesAnthropic MessagesGoogle VertexAI

Request Log Collection

Distillable

Total Context

65,536

Max Output

8,096

Input Price

¥1 / 1M tokens

Output Price

¥4 / 1M tokens

硅

硅基流动

国内

TTFT

4.85s

Throughput

12.02tps

Uptime

100.00%

Provider Model

siliconflow/deepseek-v3

Supported Parameters

Recent Uptime

5月14日 8 PM100.00%

Reasoning

Supported Response Formats

OpenAI Chat CompletionsOpenAI ResponsesAnthropic MessagesGoogle VertexAI

Request Log Collection

Distillable

Total Context

65,536

Max Output

8,096

Input Price

¥2 / 1M tokens

Output Price

¥8 / 1M tokens

腾

腾讯混元

国内

TTFT

0.35s

Throughput

2.41tps

Uptime

100.00%

Provider Model

tencent/deepseek-v3

Supported Parameters

Recent Uptime

5月12日 4 PM100.00%

Reasoning

Supported Response Formats

OpenAI Chat CompletionsOpenAI ResponsesAnthropic MessagesGoogle VertexAI

Request Log Collection

Distillable

Total Context

65,536

Max Output

8,096

Input Price

¥2 / 1M tokens

Output Price

¥8 / 1M tokens

Performance for deepseek-chat-v3

Compare different providers across Zhinao API

Throughput

18.64 tok/s

TTFT

1.77 s

Uptime for deepseek-chat-v3

Uptime for deepseek-chat-v3 across all providers

Sample code and API for deepseek-chat-v3

Get API Key

Zhinao API normalizes requests and responses across providers for you

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.360.cn/v1",
  apiKey: process.env.ZHINAO_API_KEY,
});

const response = await client.chat.completions.create({
  model: "deepseek-chat-v3",
  messages: [
    { role: "user", content: "Hello, how are you?" }
  ],
  temperature: 0.7,
  max_tokens: 1000,
});

console.log(response.choices[0].message.content);