Most Context Per Dollar
Models ranked by context window size relative to input token cost. Higher ratios mean more processing capacity for less money.
Methodology: Computed as context_window_tokens / input_price_per_1m. Only models with both values included. Higher is better.
| # | Model | Provider | Metric |
|---|---|---|---|
| 1 | Gemini 1.5 Flash | Google DeepMind | 13,333,333 tokens/$1 |
| 2 | Gemini 2.0 Flash | Google DeepMind | 10,000,000 tokens/$1 |
| 3 | Gemini 1.5 Pro | Google DeepMind | 1,600,000 tokens/$1 |
| 4 | Llama 3.3 70B | Meta | 1,280,000 tokens/$1 |
| 5 | Qwen 2.5 72B | Alibaba Cloud (Qwen) | 1,066,667 tokens/$1 |
| 6 | GPT-4o mini | OpenAI | 853,333 tokens/$1 |
| 7 | Claude 3 Haiku | Anthropic | 800,000 tokens/$1 |
| 8 | Kimi K2.5 | — | 500,000 tokens/$1 |
| 9 | DeepSeek V3.2 | — | 492,308 tokens/$1 |
| 10 | DeepSeek V3 | DeepSeek | 474,074 tokens/$1 |
| 11 | Claude 3.5 Haiku | Anthropic | 250,000 tokens/$1 |
| 12 | ERNIE 4.0 | — | 152,381 tokens/$1 |
| 13 | Mistral 7B | Mistral AI | 131,072 tokens/$1 |
| 14 | DeepSeek R1 | DeepSeek | 91,429 tokens/$1 |
| 15 | Claude 3 Sonnet | Anthropic | 66,667 tokens/$1 |
| 16 | Claude Sonnet 4.6 | Anthropic | 66,667 tokens/$1 |
| 17 | Claude 3.5 Sonnet | Anthropic | 66,667 tokens/$1 |
| 18 | Gemini 1.0 Pro | Google DeepMind | 65,536 tokens/$1 |
| 19 | Grok 2 | xAI | 65,536 tokens/$1 |
| 20 | Mistral Large 2 | Mistral AI | 64,000 tokens/$1 |
| 21 | Mixtral 8x7B | Mistral AI | 60,681 tokens/$1 |
| 22 | GPT-4o | OpenAI | 51,200 tokens/$1 |
| 23 | Command R+ | Cohere | 51,200 tokens/$1 |
| 24 | Llama 3.1 405B | Meta | 25,600 tokens/$1 |
| 25 | Llama 3 70B | Meta | 16,063 tokens/$1 |
| 26 | Claude 3 Opus | Anthropic | 13,333 tokens/$1 |
| 27 | o1 | OpenAI | 13,333 tokens/$1 |
| 28 | GPT-4 Turbo | OpenAI | 12,800 tokens/$1 |
| 29 | Claude 2 | Anthropic | 12,500 tokens/$1 |
| 30 | GPT-3.5 Turbo | OpenAI | 10,923 tokens/$1 |
| 31 | Llama 2 70B | Meta | 4,551 tokens/$1 |
| 32 | GPT-4 | OpenAI | 273 tokens/$1 |
| 33 | GPT-3 (davinci-002) | OpenAI | 68 tokens/$1 |