sourc.dev
Home LLMs Tools SaaS APIs
Claude 3.5 Sonnet input $3.00/1M ↓ -50%
GPT-4o input $2.50/1M
Gemini 1.5 Pro input $1.25/1M
Mistral Large input $2.00/1M ↓ -33%
DeepSeek V3 input $0.27/1M
synced 2026-04-05
Claude 3.5 Sonnet input $3.00/1M ↓ -50%
GPT-4o input $2.50/1M
Gemini 1.5 Pro input $1.25/1M
Mistral Large input $2.00/1M ↓ -33%
DeepSeek V3 input $0.27/1M
synced 2026-04-05
Learn

Max output tokens

Maximum number of tokens the model can generate in a single response

What is max output tokens?

Max output tokens is the maximum number of tokens a model can generate in a single response. This is a hard limit — the model stops at this boundary even if the response is incomplete. Most current models support 4,096–16,384 output tokens. If you ask for a 10,000-word document and the model's max output is 4,096 tokens (~3,000 words), it will stop mid-sentence. For long-form generation, you need to chain multiple calls.

Why it matters

If you are generating long documents, code files, or detailed analyses, the max output token limit determines whether you get a complete response or a truncated one. A model with 4,096 max output generates roughly 3,000 words before stopping. For a 10,000-word report, you need to chain 3–4 calls. sourc.dev tracks this attribute where documented by providers.

Where models stand

#1
4,096 tokens
#2
4,096 tokens

Data available for 2 of 271 tracked entities.

How sourc.dev tracks this

sourc.dev verifies max output tokens manually from official provider documentation, API responses, and published specifications. Every data point includes a source URL and verification date. When a value changes, the old value is preserved in the history table and the new value is recorded alongside it. Nothing is overwritten — the full timeline is always available.

Related
Frequently asked questions
FAQ How does sourc.dev measure max output tokens?

sourc.dev verifies this attribute manually from provider documentation. Every data point includes a source URL and verification date. Changes are recorded in the history table — nothing is overwritten.

FAQ How often is max output tokens updated?

This attribute is verified periodically against provider documentation. When sourc.dev detects a change, the new value is recorded alongside the old one with full provenance.

FAQ Why does max output tokens matter for developers?

Understanding max output tokens helps developers make informed decisions when choosing between models and providers. Rather than relying on marketing claims, sourc.dev provides verified, dated, source-linked data so the data decides.