z-ai/glm-5v-turbo

Online Chat

智谱

Publish time

2026/4/1

Model Series

GLM

Input type

Output type

Context Window

200,000

Max Output Length

128,000

Input Price

¥5 / 1M tokens

Output Price

¥22 / 1M tokens

GLM-5V-Turbo 是智谱首个多模态 Agent 基座模型，面向视觉编程与复杂任务场景深度优化，支持图像、视频、文本与文件等多模态输入，强化视觉理解、长程规划与动作执行能力。相比通用多模态模型，它更适合融入 Agent 工作流，完成「环境感知 → 任务规划 → 执行落地」的完整闭环，让多模态能力从“能看懂”走向“能行动”。

Providers for z-ai/glm-5v-turbo

Zhinao API routes requests to the best-fit provider and automatically fails over to the one with highest availability.

智

智谱

国内

TTFT

8.28s

Throughput

30.04tps

Uptime

100.00%

Provider Model

bigmodel/z-ai/glm-5v-turbo

Supported Parameters

temperaturetop_ptop_k

Recent Uptime

5月14日 11 PM100.00%

Reasoning

Toggleable

Supported Response Formats

OpenAI Chat Completions

Request Log Collection

ZDR Supported

Distillable

Yes

Total Context

200,000

Max Output

128,000

Input Price

¥5 / 1M tokens

Output Price

¥22 / 1M tokens

Performance for z-ai/glm-5v-turbo

Compare different providers across Zhinao API

Throughput

28.00 tok/s

TTFT

8.63 s

Uptime for z-ai/glm-5v-turbo

Uptime for z-ai/glm-5v-turbo across all providers

Sample code and API for z-ai/glm-5v-turbo

Get API Key

Zhinao API normalizes requests and responses across providers for you

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.360.cn/v1",
  apiKey: process.env.ZHINAO_API_KEY,
});

const response = await client.chat.completions.create({
  model: "z-ai/glm-5v-turbo",
  messages: [
    { role: "user", content: "Hello, how are you?" }
  ],
  temperature: 0.7,
  max_tokens: 1000,
});

console.log(response.choices[0].message.content);

z-ai/glm-5v-turbo

Online Chat

智谱

Publish time

2026/4/1

Model Series

GLM

Input type

Output type

Context Window

200,000

Max Output Length

128,000

Input Price

¥5 / 1M tokens

Output Price

¥22 / 1M tokens

Providers for z-ai/glm-5v-turbo

Zhinao API routes requests to the best-fit provider and automatically fails over to the one with highest availability.

智

智谱

国内

TTFT

8.28s

Throughput

30.04tps

Uptime

100.00%

Provider Model

bigmodel/z-ai/glm-5v-turbo

Supported Parameters

temperaturetop_ptop_k

Recent Uptime

5月14日 11 PM100.00%

Reasoning

Toggleable

Supported Response Formats

OpenAI Chat Completions

Request Log Collection

ZDR Supported

Distillable

Yes

Total Context

200,000

Max Output

128,000

Input Price

¥5 / 1M tokens

Output Price

¥22 / 1M tokens

Performance for z-ai/glm-5v-turbo

Compare different providers across Zhinao API

Throughput

28.00 tok/s

TTFT

8.63 s

Uptime for z-ai/glm-5v-turbo

Uptime for z-ai/glm-5v-turbo across all providers

Sample code and API for z-ai/glm-5v-turbo

Get API Key

Zhinao API normalizes requests and responses across providers for you

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.360.cn/v1",
  apiKey: process.env.ZHINAO_API_KEY,
});

const response = await client.chat.completions.create({
  model: "z-ai/glm-5v-turbo",
  messages: [
    { role: "user", content: "Hello, how are you?" }
  ],
  temperature: 0.7,
  max_tokens: 1000,
});

console.log(response.choices[0].message.content);