Back to Models
GPT

GPT OSS 120B

Supports text, thinking, function-calling, structured-outputs

Input Modalities
text
Features
thinking
function calling
structured outputs

Specifications

context window
131,072
max output tokens
32,768
Latency
13.07s
Throughput
1101.00 TPS

Pricing

Pricing is based on the number of tokens used, or other metrics based on the model type.

Per 1M tokens
Input
$0.35
Output
$0.75