千

alibaba/qwen-vl-max

Online Chat

阿里巴巴

Publish time

2025/2/1

Model Series

千问

Input type

Output type

Context Window

131,072

Max Output Length

2,048

Input Price

¥3 / 1M tokens

Output Price

¥9 / 1M tokens

相比qwen-vl-plus再次提升视觉推理和指令遵循能力，在更多复杂任务中提供最佳性能。

Providers for alibaba/qwen-vl-max

Zhinao API routes requests to the best-fit provider and automatically fails over to the one with highest availability.

通

通义千问

国内

TTFT

No data

Throughput

11.79tps

Uptime

36.00%

Provider Model

qwen-vl-max

Supported Parameters

Recent Uptime

5月24日 11 PM36.21%

Reasoning

Supported Response Formats

OpenAI Chat CompletionsOpenAI ResponsesAnthropic MessagesGoogle VertexAI

Request Log Collection

Distillable

Total Context

131,072

Max Output

2,048

Input Price

¥3 / 1M tokens

Output Price

¥9 / 1M tokens

Performance for alibaba/qwen-vl-max

Compare different providers across Zhinao API

Throughput

9.54 tok/s

TTFT

0.77 s

Uptime for alibaba/qwen-vl-max

Uptime for alibaba/qwen-vl-max across all providers

Sample code and API for alibaba/qwen-vl-max

Get API Key

Zhinao API normalizes requests and responses across providers for you

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.360.cn/v1",
  apiKey: process.env.ZHINAO_API_KEY,
});

const response = await client.chat.completions.create({
  model: "alibaba/qwen-vl-max",
  messages: [
    { role: "user", content: "Hello, how are you?" }
  ],
  temperature: 0.7,
  max_tokens: 1000,
});

console.log(response.choices[0].message.content);

千

alibaba/qwen-vl-max

Online Chat

阿里巴巴

Publish time

2025/2/1

Model Series

千问

Input type

Output type

Context Window

131,072

Max Output Length

2,048

Input Price

¥3 / 1M tokens

Output Price

¥9 / 1M tokens

相比qwen-vl-plus再次提升视觉推理和指令遵循能力，在更多复杂任务中提供最佳性能。

Providers for alibaba/qwen-vl-max

Zhinao API routes requests to the best-fit provider and automatically fails over to the one with highest availability.

通

通义千问

国内

TTFT

No data

Throughput

11.79tps

Uptime

36.00%

Provider Model

qwen-vl-max

Supported Parameters

Recent Uptime

5月24日 11 PM36.21%

Reasoning

Supported Response Formats

OpenAI Chat CompletionsOpenAI ResponsesAnthropic MessagesGoogle VertexAI

Request Log Collection

Distillable

Total Context

131,072

Max Output

2,048

Input Price

¥3 / 1M tokens

Output Price

¥9 / 1M tokens

Performance for alibaba/qwen-vl-max

Compare different providers across Zhinao API

Throughput

9.54 tok/s

TTFT

0.77 s

Uptime for alibaba/qwen-vl-max

Uptime for alibaba/qwen-vl-max across all providers

Sample code and API for alibaba/qwen-vl-max

Get API Key

Zhinao API normalizes requests and responses across providers for you

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.360.cn/v1",
  apiKey: process.env.ZHINAO_API_KEY,
});

const response = await client.chat.completions.create({
  model: "alibaba/qwen-vl-max",
  messages: [
    { role: "user", content: "Hello, how are you?" }
  ],
  temperature: 0.7,
  max_tokens: 1000,
});

console.log(response.choices[0].message.content);