SmartBrain API

Enterprise AI as Simple as Utilities

More Links
Model HubConsoleAPI KeysUsage QueryDocs
SmartBrain API. All rights reserved|Privacy Policy|Terms
SmartBrain API
SmartBrain API
  • Model Hub
  • API Docs
  • Playground
G

z-ai/glm-4.5v

Online Chat
智谱

Publish time

2025/8/11

Model Series

GLM

Input type

Output type

Context Window

128,000

Max Output Length

2,048

Input Price

¥4 / 1M tokens

Output Price

¥12 / 1M tokens

GLM-4.5V 是智谱新一代基于 MOE 架构的视觉推理模型,以 106B 的总参数量和 12B 激活参数量,在各类基准测试中达到全球同级别开源多模态模型 SOTA,涵盖图像、视频、文档理解及 GUI 任务等常见任务。

Providers for z-ai/glm-4.5v

Zhinao API routes requests to the best-fit provider and automatically fails over to the one with highest availability.

智
智谱
国内

TTFT

0.18s

Throughput

29.77tps

Uptime

100.00%

Provider Model

bigmodel/glm-4.5v

Supported Parameters

Recent Uptime

5月13日 10 AM100.00%
No dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo dataNo data5月12日 10 AM: 100.00%5月12日 12 PM: 100.00%5月12日 4 PM: 100.00%5月13日 10 AM: 100.00%

Reasoning

-

Supported Response Formats

OpenAI Chat CompletionsOpenAI ResponsesAnthropic MessagesGoogle VertexAI

Request Log Collection

-

Distillable

-

Total Context

128,000

Max Output

2,048

Input Price

¥4 / 1M tokens

Output Price

¥12 / 1M tokens

Performance for z-ai/glm-4.5v

Compare different providers across Zhinao API

Throughput

39.00 tok/s

TTFT

0.47 s

Uptime for z-ai/glm-4.5v

Uptime for z-ai/glm-4.5v across all providers

Sample code and API for z-ai/glm-4.5v

Get API Key

Zhinao API normalizes requests and responses across providers for you

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.360.cn/v1",
  apiKey: process.env.ZHINAO_API_KEY,
});

const response = await client.chat.completions.create({
  model: "z-ai/glm-4.5v",
  messages: [
    { role: "user", content: "Hello, how are you?" }
  ],
  temperature: 0.7,
  max_tokens: 1000,
});

console.log(response.choices[0].message.content);