Max output tokens
Maximum number of tokens the model can generate in a single response
What is max output tokens?
Max output tokens is the maximum number of tokens a model can generate in a single response. This is a hard limit — the model stops at this boundary even if the response is incomplete. Most current models support 4,096–16,384 output tokens. If you ask for a 10,000-word document and the model's max output is 4,096 tokens (~3,000 words), it will stop mid-sentence. For long-form generation, you need to chain multiple calls.
Why it matters
If you are generating long documents, code files, or detailed analyses, the max output token limit determines whether you get a complete response or a truncated one. A model with 4,096 max output generates roughly 3,000 words before stopping. For a 10,000-word report, you need to chain 3–4 calls. sourc.dev tracks this attribute where documented by providers.
Where models stand
Data available for 2 of 271 tracked entities.
How sourc.dev tracks this
sourc.dev verifies max output tokens manually from official provider documentation, API responses, and published specifications. Every data point includes a source URL and verification date. When a value changes, the old value is preserved in the history table and the new value is recorded alongside it. Nothing is overwritten — the full timeline is always available.
sourc.dev verifies this attribute manually from provider documentation. Every data point includes a source URL and verification date. Changes are recorded in the history table — nothing is overwritten.
This attribute is verified periodically against provider documentation. When sourc.dev detects a change, the new value is recorded alongside the old one with full provenance.
Understanding max output tokens helps developers make informed decisions when choosing between models and providers. Rather than relying on marketing claims, sourc.dev provides verified, dated, source-linked data so the data decides.