Data
LLM API Pricing Tracker
Current API prices per 1 million tokens for every major large language model, collected from official provider pricing pages and tracked over time. When a provider moves a price, the change is recorded here with the before and after.
Last verified: 2026-06-11 · 19 models · 5 providers · Updated daily
OpenAI
| Model | Input / 1M | Output / 1M | Context | Notes |
|---|---|---|---|---|
| gpt-5.5-pro | $30.00 | $180.00 | — | |
| gpt-5.5 | $5.00 | $30.00 | — | Short-context rate; long-context tier is $10 in / $45 out |
| gpt-5.4 | $2.5 | $15.00 | — | Short-context rate; long-context tier is $5 in / $22.50 out |
| gpt-5.3-codex | $1.75 | $14.00 | — | |
| gpt-5.4-mini | $0.75 | $4.5 | — | |
| gpt-5.4-nano | $0.2 | $1.25 | — |
Anthropic
| Model | Input / 1M | Output / 1M | Context | Notes |
|---|---|---|---|---|
| claude-fable-5 | $10.00 | $50.00 | 1M | |
| claude-opus-4-8 | $5.00 | $25.00 | 1M | |
| claude-opus-4-7 | $5.00 | $25.00 | 1M | |
| claude-sonnet-4-6 | $3.00 | $15.00 | 1M | |
| claude-haiku-4-5 | $1.00 | $5.00 | 200K |
| Model | Input / 1M | Output / 1M | Context | Notes |
|---|---|---|---|---|
| gemini-3.1-pro-preview | $2.00 | $12.00 | — | Rate for prompts up to 200K tokens; above 200K is $4 in / $18 out |
| gemini-3.5-flash | $1.5 | $9.00 | — | |
| gemini-3-flash-preview | $0.5 | $3.00 | — | Text/image/video input rate; audio input is $1.00 |
| gemini-3.1-flash-lite | $0.25 | $1.5 | — | Text/image/video input rate; audio input is $0.50 |
xAI
| Model | Input / 1M | Output / 1M | Context | Notes |
|---|---|---|---|---|
| grok-4.3 | $1.25 | $2.5 | 1M | |
| grok-build-0.1 | $1.00 | $2.00 | 256K |
DeepSeek
| Model | Input / 1M | Output / 1M | Context | Notes |
|---|---|---|---|---|
| deepseek-v4-pro | $0.435 | $0.87 | 1M | Cache-miss input rate; cache-hit input is $0.003625 |
| deepseek-v4-flash | $0.14 | $0.28 | 1M | Cache-miss input rate; cache-hit input is $0.0028 |
Price Change Log
Free JSON API
The full dataset is available as JSON, free to use with attribution and a link back to this page. CORS is enabled, so it works directly from the browser.
curl https://techfastforward.com/api/data/llm-pricingMethodology
Prices are read daily from each provider's official pricing page (linked on every model row) and stored only when they differ from the last recorded value, so the change log reflects real price moves rather than re-checks. All figures are standard pay-as-you-go API rates in USD per 1 million tokens; batch, cached-input, and volume discounts are noted per model where they materially differ.
Coverage focuses on text-generation flagship and workhorse models from OpenAI, Anthropic, Google, xAI, and DeepSeek. Spot an error or a missing model? Email editor@techfastforward.com.
Maintained by Jordan Hale, TechFastForward. Read our latest pricing coverage in Big Tech and Model Releases.