GLM-5.1

认证

authorization `string` required

所有 API 均使用 Bearer Token 鉴权

获取 API Key：

用法：

将以下 Header 添加到请求中：

Authorization: Bearer YOUR_API_KEY

参数

model `string` required

请求使用的模型 ID

值: glm-5.1

messages `array` required

表示对话历史的消息对象数组

role string required

消息角色

可选值: user, assistant, system, developer

content string | array required

文本字符串或多模态数组

max_tokens `integer`

生成的最大 token 数量

范围: 1 - 131072

temperature `number`

采样温度, 值越高输出越随机, 值越低输出越集中和确定

默认值: 0.7

范围: 0.0 - 2.0

top_p `number`

核采样参数, 可作为温度采样的替代方案

默认值: 1.0

范围: 0.0 - 1.0

stream `boolean`

是否增量流式返回响应

默认值: false

frequency_penalty `number`

频率惩罚参数, 用于减少重复 token

默认值: 0.0

范围: -2.0 - 2.0

presence_penalty `number`

存在惩罚参数, 用于鼓励引入新话题

默认值: 0.0

范围: -2.0 - 2.0

stop `string | array`

模型停止生成的序列, 最多支持 4 个字符串

response_format `object`

模型输出的格式规范

type string required

输出格式类型

可选值: text, json_object

tools `array`

模型可以调用的工具列表

type string required

工具类型

值: function

function object required

函数定义

tool_choice `string | object`

控制模型调用哪个工具

可选值: none, auto, 或特定工具对象

默认值: auto

响应格式

error `object`

错误信息, 仅在状态为 failed 时存在

code string

错误码

message string

详细错误信息

output `array`

生成结果, 仅在状态为 completed 时存在

content array

生成的资源内容列表

type string

资源类型

值: image｜video

url string

处理后的资源 URL

jobId string

远端任务 ID

usage `object`

使用统计, 仅在状态为 completed 时存在

cost string

总费用（美元）

discount number

折扣金额

metadata `object`

元数据信息

curl --location 'https://cloud.vtrix.ai/llm/chat/completions' \ --header 'Authorization: Bearer YOUR_API_KEY' \ --header 'Content-Type: application/json' \ --data '{ "model": "glm-5.1", "messages": [{"role": "user", "content": "你好!"}] }'

import openai client = openai.OpenAI( api_key="YOUR_API_KEY", base_url="https://cloud.vtrix.ai/llm" ) response = client.chat.completions.create( model="glm-5.1", messages=[{"role": "user", "content": "你好!"}] ) print(response.choices[0].message.content)

import OpenAI from 'openai'; const client = new OpenAI({ apiKey: 'YOUR_API_KEY', baseURL: 'https://cloud.vtrix.ai/llm' }); const response = await client.chat.completions.create({ model: 'glm-5.1', messages: [{ role: 'user', content: '你好!' }] }); console.log(response.choices[0].message.content);

{ "id": "chatcmpl-abc123", "object": "chat.completion", "created": 1699896916, "model": "glm-5.1", "choices": [ { "index": 0, "message": { "role": "assistant", "content": "你好! 有什么我可以帮助你的吗?" }, "finish_reason": "stop" } ], "usage": { "prompt_tokens": 10, "completion_tokens": 20, "total_tokens": 30 } }

认证

authorization `string` required

参数

model `string` required

messages `array` required

role `string` required

content `string | array` required

max_tokens `integer`

temperature `number`

top_p `number`

stream `boolean`

frequency_penalty `number`

presence_penalty `number`

stop `string | array`

response_format `object`

type `string` required

tools `array`

type `string` required

function `object` required

tool_choice `string | object`

响应格式

error `object`

code `string`

message `string`

output `array`

content `array`

type `string`

url `string`

jobId `string`

usage `object`

cost `string`

discount `number`

metadata `object`

认证

authorization string required

参数

model string required

messages array required

role string required

content string | array required

max_tokens integer

temperature number

top_p number

stream boolean

frequency_penalty number

presence_penalty number

stop string | array

response_format object

type string required

tools array

type string required

function object required

tool_choice string | object

响应格式

error object

code string

message string

output array

content array

type string

url string

jobId string

usage object

cost string

discount number

metadata object

authorization `string` required

model `string` required

messages `array` required

role `string` required

content `string | array` required

max_tokens `integer`

temperature `number`

top_p `number`

stream `boolean`

frequency_penalty `number`

presence_penalty `number`

stop `string | array`

response_format `object`

type `string` required

tools `array`

type `string` required

function `object` required

tool_choice `string | object`

error `object`

code `string`

message `string`

output `array`

content `array`

type `string`

url `string`

jobId `string`

usage `object`

cost `string`

discount `number`

metadata `object`