Publish time
2026/4/1Model Series
GLMInput type
Output type
Context Window
200,000Max Output Length
128,000Input Price
¥5 / 1M tokensOutput Price
¥22 / 1M tokensGLM-5V-Turbo 是智谱首个多模态 Agent 基座模型,面向视觉编程与复杂任务场景深度优化,支持图像、视频、文本与文件等多模态输入,强化视觉理解、长程规划与动作执行能力。相比通用多模态模型,它更适合融入 Agent 工作流,完成「环境感知 → 任务规划 → 执行落地」的完整闭环,让多模态能力从“能看懂”走向“能行动”。
Zhinao API routes requests to the best-fit provider and automatically fails over to the one with highest availability.
TTFT
8.28s
Throughput
30.04tps
Uptime
100.00%
Provider Model
bigmodel/z-ai/glm-5v-turbo
Supported Parameters
Recent Uptime
Reasoning
Toggleable
Supported Response Formats
Request Log Collection
ZDR Supported
Distillable
Yes
Total Context
200,000
Max Output
128,000
Input Price
¥5 / 1M tokens
Output Price
¥22 / 1M tokens
Compare different providers across Zhinao API
28.00 tok/s
8.63 s
Uptime for z-ai/glm-5v-turbo across all providers
Zhinao API normalizes requests and responses across providers for you
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://api.360.cn/v1",
apiKey: process.env.ZHINAO_API_KEY,
});
const response = await client.chat.completions.create({
model: "z-ai/glm-5v-turbo",
messages: [
{ role: "user", content: "Hello, how are you?" }
],
temperature: 0.7,
max_tokens: 1000,
});
console.log(response.choices[0].message.content);