DeepSeek V4 Flash

Authentication

authorization `string` required

All APIs require authentication via Bearer Token.

Get API Key:

Visit API Key Management Page to get your API Key.

Usage:

Add to request header:

Authorization: Bearer YOUR_API_KEY

Parameters

model `string` required

Model ID to use for the request.

Value: deepseek-v4-flash

messages `array` required

Array of message objects representing the conversation history.

role string required

Message role.

Options: user, assistant, system, developer

content string | array required

Text string or multimodal array.

max_tokens `integer`

Maximum tokens to generate in the completion.

Range: 1 - 384000

temperature `number`

Sampling temperature. Higher values make output more random, lower values make it more focused and deterministic.

Default: 1.0

Range: 0.0 - 2.0

top_p `number`

Nucleus sampling parameter. Alternative to temperature sampling.

Default: 1.0

Range: 0.0 - 1.0

stream `boolean`

Whether to stream response incrementally.

Default: false

frequency_penalty `number`

Penalty for token frequency to reduce repetition.

Default: 0.0

Range: -2.0 - 2.0

presence_penalty `number`

Penalty for token presence to encourage new topics.

Default: 0.0

Range: -2.0 - 2.0

stop `string | array`

Sequence(s) where the model will stop generating further tokens.

response_format `object`

Format specification for the model output.

type string required

Output format type.

Options: text, json_object

tools `array`

List of tools the model may call.

type string required

Tool type.

Value: function

function object required

Function definition.

tool_choice `string | object`

Controls which tool is called by the model.

Options: none, auto, required, or specific tool object

Default: auto

Response Format

error `object`

Error information, only present when status is failed

code string

Error code

message string

Detailed error message

output `array`

Generation results, only present when status is completed

content array

List of generated resource content

type string

Resource type

Value: image｜video

url string

Processed resource URL

jobId string

Remote task ID

usage `object`

Usage statistics, only present when status is completed

cost string

Total cost in USD

discount number

Discount amount

metadata `object`

Metadata information

curl --location 'https://cloud.vtrix.ai/llm/chat/completions' \ --header 'Authorization: Bearer YOUR_API_KEY' \ --header 'Content-Type: application/json' \ --data '{ "model": "deepseek-v4-flash", "messages": [{"role": "user", "content": "Hello!"}] }'

import openai client = openai.OpenAI( api_key="YOUR_API_KEY", base_url="https://cloud.vtrix.ai/llm" ) response = client.chat.completions.create( model="deepseek-v4-flash", messages=[{"role": "user", "content": "Hello!"}] ) print(response.choices[0].message.content)

import OpenAI from 'openai'; const client = new OpenAI({ apiKey: 'YOUR_API_KEY', baseURL: 'https://cloud.vtrix.ai/llm' }); const response = await client.chat.completions.create({ model: 'deepseek-v4-flash', messages: [{ role: 'user', content: 'Hello!' }] }); console.log(response.choices[0].message.content);

{ "id": "chatcmpl-abc123", "object": "chat.completion", "created": 1699896916, "model": "deepseek-v4-flash", "choices": [ { "index": 0, "message": { "role": "assistant", "content": "Hello! How can I help you today?" }, "finish_reason": "stop" } ], "usage": { "prompt_tokens": 10, "completion_tokens": 20, "total_tokens": 30 } }

Authentication

authorization `string` required

Parameters

model `string` required

messages `array` required

role `string` required

content `string | array` required

max_tokens `integer`

temperature `number`

top_p `number`

stream `boolean`

frequency_penalty `number`

presence_penalty `number`

stop `string | array`

response_format `object`

type `string` required

tools `array`

type `string` required

function `object` required

tool_choice `string | object`

Response Format

error `object`

code `string`

message `string`

output `array`

content `array`

type `string`

url `string`

jobId `string`

usage `object`

cost `string`

discount `number`

metadata `object`

Authentication

authorization string required

Parameters

model string required

messages array required

role string required

content string | array required

max_tokens integer

temperature number

top_p number

stream boolean

frequency_penalty number

presence_penalty number

stop string | array

response_format object

type string required

tools array

type string required

function object required

tool_choice string | object

Response Format

error object

code string

message string

output array

content array

type string

url string

jobId string

usage object

cost string

discount number

metadata object

authorization `string` required

model `string` required

messages `array` required

role `string` required

content `string | array` required

max_tokens `integer`

temperature `number`

top_p `number`

stream `boolean`

frequency_penalty `number`

presence_penalty `number`

stop `string | array`

response_format `object`

type `string` required

tools `array`

type `string` required

function `object` required

tool_choice `string | object`

error `object`

code `string`

message `string`

output `array`

content `array`

type `string`

url `string`

jobId `string`

usage `object`

cost `string`

discount `number`

metadata `object`