Pricing Docs Changelogs Templates Models Dashboard

GLM-4.6

Supports text, vision, thinking, tool-use, function-calling, structured-outputs

Input Modalities

text

vision

Features

thinking

tool use

function calling

structured outputs

Specifications

context window

204,800

max output tokens

131,072

Latency

1.67s

Throughput

34.37 TPS

Pricing

Pricing is based on the number of tokens used, or other metrics based on the model type.

Per 1M tokens

Input

$0.45

Output

$1.90