kimi-k2

Online Chat

Publish time

Model Series

Input type

Output type

Context Window

128,000

Max Output Length

32,000

Input Price

¥4 / 1M tokens

Output Price

¥16 / 1M tokens

Kimi-K2 是一款Moonshot AI推出的具备超强代码和 Agent 能力的 MoE 架构基础模型，总参数 1T，激活参数 32B。在通用知识推理、编程、数学、Agent 等主要类别的基准性能测试中，K2 模型的性能超过其他主流开源模型。

Providers for kimi-k2

Zhinao API routes requests to the best-fit provider and automatically fails over to the one with highest availability.

国内

TTFT

No data

Throughput

No data

Uptime

No data

Provider Model

volcengine/kimi-k2-250711

Supported Parameters

Recent Uptime

5月14日 8 PM

Reasoning

Supported Response Formats

OpenAI Chat CompletionsOpenAI ResponsesAnthropic MessagesGoogle VertexAI

Request Log Collection

Distillable

Total Context

4,096

Max Output

2,048

Input Price

¥4 / 1M tokens

Output Price

¥16 / 1M tokens

Performance for kimi-k2

Compare different providers across Zhinao API

Throughput

21.35 tok/s

TTFT

6.10 s

Uptime for kimi-k2

Uptime for kimi-k2 across all providers

Sample code and API for kimi-k2

Get API Key

Zhinao API normalizes requests and responses across providers for you

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.360.cn/v1",
  apiKey: process.env.ZHINAO_API_KEY,
});

const response = await client.chat.completions.create({
  model: "kimi-k2",
  messages: [
    { role: "user", content: "Hello, how are you?" }
  ],
  temperature: 0.7,
  max_tokens: 1000,
});

console.log(response.choices[0].message.content);

kimi-k2

Online Chat

Publish time

Model Series

Input type

Output type

Context Window

128,000

Max Output Length

32,000

Input Price

¥4 / 1M tokens

Output Price

¥16 / 1M tokens

Providers for kimi-k2

Zhinao API routes requests to the best-fit provider and automatically fails over to the one with highest availability.

国内

TTFT

No data

Throughput

No data

Uptime

No data

Provider Model

volcengine/kimi-k2-250711

Supported Parameters

Recent Uptime

5月14日 8 PM

Reasoning

Supported Response Formats

OpenAI Chat CompletionsOpenAI ResponsesAnthropic MessagesGoogle VertexAI

Request Log Collection

Distillable

Total Context

4,096

Max Output

2,048

Input Price

¥4 / 1M tokens

Output Price

¥16 / 1M tokens

Performance for kimi-k2

Compare different providers across Zhinao API

Throughput

21.35 tok/s

TTFT

6.10 s

Uptime for kimi-k2

Uptime for kimi-k2 across all providers

Sample code and API for kimi-k2

Get API Key

Zhinao API normalizes requests and responses across providers for you

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.360.cn/v1",
  apiKey: process.env.ZHINAO_API_KEY,
});

const response = await client.chat.completions.create({
  model: "kimi-k2",
  messages: [
    { role: "user", content: "Hello, how are you?" }
  ],
  temperature: 0.7,
  max_tokens: 1000,
});

console.log(response.choices[0].message.content);