z-ai/glm-4.5v

Online Chat

智谱

Publish time

2025/8/11

Model Series

GLM

Input type

Output type

Context Window

128,000

Max Output Length

2,048

Input Price

¥4 / 1M tokens

Output Price

¥12 / 1M tokens

GLM-4.5V 是智谱新一代基于 MOE 架构的视觉推理模型，以 106B 的总参数量和 12B 激活参数量，在各类基准测试中达到全球同级别开源多模态模型 SOTA，涵盖图像、视频、文档理解及 GUI 任务等常见任务。

Providers for z-ai/glm-4.5v

Zhinao API routes requests to the best-fit provider and automatically fails over to the one with highest availability.

智

智谱

国内

TTFT

0.18s

Throughput

29.77tps

Uptime

100.00%

Provider Model

bigmodel/glm-4.5v

Supported Parameters

Recent Uptime

5月13日 10 AM100.00%

Reasoning

Supported Response Formats

OpenAI Chat CompletionsOpenAI ResponsesAnthropic MessagesGoogle VertexAI

Request Log Collection

Distillable

Total Context

128,000

Max Output

2,048

Input Price

¥4 / 1M tokens

Output Price

¥12 / 1M tokens

Performance for z-ai/glm-4.5v

Compare different providers across Zhinao API

Throughput

39.00 tok/s

TTFT

0.47 s

Uptime for z-ai/glm-4.5v

Uptime for z-ai/glm-4.5v across all providers

Sample code and API for z-ai/glm-4.5v

Get API Key

Zhinao API normalizes requests and responses across providers for you

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.360.cn/v1",
  apiKey: process.env.ZHINAO_API_KEY,
});

const response = await client.chat.completions.create({
  model: "z-ai/glm-4.5v",
  messages: [
    { role: "user", content: "Hello, how are you?" }
  ],
  temperature: 0.7,
  max_tokens: 1000,
});

console.log(response.choices[0].message.content);

z-ai/glm-4.5v

Online Chat

智谱

Publish time

2025/8/11

Model Series

GLM

Input type

Output type

Context Window

128,000

Max Output Length

2,048

Input Price

¥4 / 1M tokens

Output Price

¥12 / 1M tokens

Providers for z-ai/glm-4.5v

Zhinao API routes requests to the best-fit provider and automatically fails over to the one with highest availability.

智

智谱

国内

TTFT

0.18s

Throughput

29.77tps

Uptime

100.00%

Provider Model

bigmodel/glm-4.5v

Supported Parameters

Recent Uptime

5月13日 10 AM100.00%

Reasoning

Supported Response Formats

OpenAI Chat CompletionsOpenAI ResponsesAnthropic MessagesGoogle VertexAI

Request Log Collection

Distillable

Total Context

128,000

Max Output

2,048

Input Price

¥4 / 1M tokens

Output Price

¥12 / 1M tokens

Performance for z-ai/glm-4.5v

Compare different providers across Zhinao API

Throughput

39.00 tok/s

TTFT

0.47 s

Uptime for z-ai/glm-4.5v

Uptime for z-ai/glm-4.5v across all providers

Sample code and API for z-ai/glm-4.5v

Get API Key

Zhinao API normalizes requests and responses across providers for you

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.360.cn/v1",
  apiKey: process.env.ZHINAO_API_KEY,
});

const response = await client.chat.completions.create({
  model: "z-ai/glm-4.5v",
  messages: [
    { role: "user", content: "Hello, how are you?" }
  ],
  temperature: 0.7,
  max_tokens: 1000,
});

console.log(response.choices[0].message.content);