Back to Models
GLM (Zhipu)

GLM-4.6

Supports text, vision, thinking, tool-use, function-calling, structured-outputs

Input Modalities
text
vision
Features
thinking
tool use
function calling
structured outputs

Specifications

context window
204,800
max output tokens
131,072
Latency
1.67s
Throughput
34.37 TPS

Pricing

Pricing is based on the number of tokens used, or other metrics based on the model type.

Per 1M tokens
Input
$0.45
Output
$1.90