DeepSeek

https://www.deepseek.com/

3 models

Sort by

DeepSeek-V4 拥有百万字超长上下文，在 Agent 能力、世界知识和推理性能上均实现国内与开源领域的领先。相比 DeepSeek-V4-Pro，DeepSeek-V4-Flash 在世界知识储备方面稍逊一筹，但展现出了接近的推理能力。而由于模型参数和激活更小，相较之下 V4-Flash 能够提供更加快捷、经济的 API 服务。

通用长文本函数调用翻译代码+1

Input Price:¥1 / 1M tokens

Output Price:¥2 / 1M tokens

Context:1,000,000

Max Output:384,000

Providers

d s h

5月14日 11 PM100.00%

Input:

Output:

deepseek/deepseek-v4-pro

DeepSeek

DeepSeek-V4 拥有百万字超长上下文，在 Agent 能力、世界知识和推理性能上均实现国内与开源领域的领先。相比前代模型，DeepSeek-V4-Pro 的 Agent 能力显著增强。在 Agentic Coding 评测中，V4-Pro 已达到当前开源模型最佳水平，并在其他 Agent 相关评测中同样表现优异。

通用长文本函数调用翻译代码+2

Input Price:¥3 / 1M tokens

Output Price:¥6 / 1M tokens

Context:1,000,000

Max Output:384,000

Providers

d s

5月14日 11 PM99.86%

Input:

Output:

deepseek/deepseek-r1-distill-qwen-32b

DeepSeek

DeepSeek-R1-Distill 模型是在开源模型的基础上通过微调训练得到的，训练过程中使用了由DeepSeek-R1生成的样本数据。DeepSeek-R1是由深度求索推出的推理大模型。DeepSeek-R1在后训练阶段大规模使用了强化学习技术，在仅有极少标注数据的情况下，极大提升了模型推理能力。在数学、代码、自然语言推理等任务上，性能比肩 OpenAI o1 正式版。

Input Price:¥1.5 / 1M tokens

Output Price:¥6 / 1M tokens

Context:65,536

Max Output:8,096

Providers

贵

5月14日 10 AM100.00%