LLM API price history
Review recent price changes detected by the static pricing pipeline before you route production traffic. Each entry keeps the model, provider, changed field, previous value, new value, and timestamp when the change log was generated.
Recent pricing changes
The change log is generated from the latest OpenRouter snapshot and curated official provider rows. Empty states mean the current baseline has no appended price changes yet, not that provider prices are permanent.
| Model / Provider | Change | Timestamp |
|---|---|---|
| MythoMax 13B via NextBitnextbit | Output added at $0.060 · review | 2026-06-08 |
| MythoMax 13B via NextBitnextbit | Input added at $0.060 · review | 2026-06-08 |
| ReMM SLERP 13B via NextBitnextbit | Output removed from $0.65 · review | 2026-06-08 |
| ReMM SLERP 13B via NextBitnextbit | Input removed from $0.45 · review | 2026-06-08 |
| Meta: Llama 3 8B Instruct via Novitanovita | Output removed from $0.040 · review | 2026-06-08 |
| Meta: Llama 3 8B Instruct via Novitanovita | Input removed from $0.040 · review | 2026-06-08 |
| Meta: Llama 3 8B Instruct via OpenRouteropenrouter | Output $0.040 -> $0.14 · +250.0% · review | 2026-06-08 |
| Meta: Llama 3 8B Instruct via OpenRouteropenrouter | Input $0.040 -> $0.14 · +250.0% · review | 2026-06-08 |
| OpenAI: GPT-4o-mini via Azureazure | Cached tokens added at $0.075 · review | 2026-06-08 |
| OpenAI: GPT-4o-mini via Azureazure | Output added at $0.60 · review | 2026-06-08 |
| OpenAI: GPT-4o-mini via Azureazure | Input added at $0.15 · review | 2026-06-08 |
| Meta: Llama 3.1 8B Instruct via Cloudflarecloudflare | Output removed from $0.29 · review | 2026-06-08 |
| Meta: Llama 3.1 8B Instruct via Cloudflarecloudflare | Input removed from $0.15 · review | 2026-06-08 |
| Meta: Llama 3.1 8B Instruct via DeepInfradeepinfra | Output $0.050 -> $0.030 · -40.0% | 2026-06-08 |
| Meta: Llama 3.1 8B Instruct via OpenRouteropenrouter | Output $0.050 -> $0.030 · -40.0% | 2026-06-08 |
| Nous: Hermes 3 70B Instruct via DeepInfradeepinfra | Output $0.30 -> $0.70 · +133.3% · review | 2026-06-08 |
| Nous: Hermes 3 70B Instruct via DeepInfradeepinfra | Input $0.30 -> $0.70 · +133.3% · review | 2026-06-08 |
| Nous: Hermes 3 70B Instruct via OpenRouteropenrouter | Output $0.30 -> $0.70 · +133.3% · review | 2026-06-08 |
| Nous: Hermes 3 70B Instruct via OpenRouteropenrouter | Input $0.30 -> $0.70 · +133.3% · review | 2026-06-08 |
| Qwen2.5 72B Instruct via Novitanovita | Output added at $0.40 · review | 2026-06-08 |
| Qwen2.5 72B Instruct via Novitanovita | Input added at $0.38 · review | 2026-06-08 |
| Meta: Llama 3.2 11B Vision Instruct via DeepInfradeepinfra | Output $0.24 -> $0.34 · +40.8% | 2026-06-08 |
| Meta: Llama 3.2 11B Vision Instruct via DeepInfradeepinfra | Input $0.24 -> $0.34 · +40.8% | 2026-06-08 |
| Meta: Llama 3.2 11B Vision Instruct via OpenRouteropenrouter | Output $0.24 -> $0.34 · +40.8% | 2026-06-08 |
| Meta: Llama 3.2 11B Vision Instruct via OpenRouteropenrouter | Input $0.24 -> $0.34 · +40.8% | 2026-06-08 |
| Qwen: Qwen2.5 7B Instruct via Phalaphala | Output removed from $0.10 · review | 2026-06-08 |
| Qwen: Qwen2.5 7B Instruct via Phalaphala | Input removed from $0.040 · review | 2026-06-08 |
| Amazon: Nova Micro 1.0 via Amazon Bedrockamazon-bedrock-eu-west-1 | Output added at $0.14 · review | 2026-06-08 |
| Amazon: Nova Micro 1.0 via Amazon Bedrockamazon-bedrock-eu-west-1 | Input added at $0.035 · review | 2026-06-08 |
| Meta: Llama 3.3 70B Instruct via Googlegoogle-vertex | Output added at $0.72 · review | 2026-06-08 |
| Meta: Llama 3.3 70B Instruct via Googlegoogle-vertex | Input added at $0.72 · review | 2026-06-08 |
| Meta: Llama 3.3 70B Instruct via Novitanovita | Output removed from $0.40 · review | 2026-06-08 |
| Meta: Llama 3.3 70B Instruct via Novitanovita | Input removed from $0.14 · review | 2026-06-08 |
| DeepSeek: R1 via Azureazure | Output added at $5.94 · review | 2026-06-08 |
| DeepSeek: R1 via Azureazure | Input added at $1.49 · review | 2026-06-08 |
| Google: Gemma 3 12B via DeepInfradeepinfra | Output $0.13 -> $0.15 · +15.4% | 2026-06-08 |
| Google: Gemma 3 12B via DeepInfradeepinfra | Input $0.040 -> $0.050 · +25.0% | 2026-06-08 |
| Google: Gemma 3 12B via OpenRouteropenrouter | Output $0.13 -> $0.15 · +15.4% | 2026-06-08 |
| Google: Gemma 3 12B via OpenRouteropenrouter | Input $0.040 -> $0.050 · +25.0% | 2026-06-08 |
| Google: Gemma 3 4B via DeepInfradeepinfra | Output $0.080 -> $0.10 · +25.0% | 2026-06-08 |
| Google: Gemma 3 4B via DeepInfradeepinfra | Input $0.040 -> $0.050 · +25.0% | 2026-06-08 |
| Google: Gemma 3 4B via OpenRouteropenrouter | Output $0.080 -> $0.10 · +25.0% | 2026-06-08 |
| Google: Gemma 3 4B via OpenRouteropenrouter | Input $0.040 -> $0.050 · +25.0% | 2026-06-08 |
| DeepSeek: DeepSeek V3 0324 via GMICloudgmicloud | Cached tokens removed from $0.088 · review | 2026-06-08 |
| DeepSeek: DeepSeek V3 0324 via GMICloudgmicloud | Output removed from $0.91 · review | 2026-06-08 |
| DeepSeek: DeepSeek V3 0324 via GMICloudgmicloud | Input removed from $0.23 · review | 2026-06-08 |
| DeepSeek: DeepSeek V3 0324 via ModelRunmodelrun | Cached tokens removed from $0.15 · review | 2026-06-08 |
| DeepSeek: DeepSeek V3 0324 via ModelRunmodelrun | Output removed from $0.80 · review | 2026-06-08 |
| DeepSeek: DeepSeek V3 0324 via ModelRunmodelrun | Input removed from $0.22 · review | 2026-06-08 |
| Meta: Llama 4 Scout via DeepInfradeepinfra | Input $0.080 -> $0.10 · +25.0% | 2026-06-08 |
| Meta: Llama 4 Scout via OpenRouteropenrouter | Input $0.080 -> $0.10 · +25.0% | 2026-06-08 |
| Qwen: Qwen3 14B via NextBitnextbit | Output added at $0.24 · review | 2026-06-08 |
| Qwen: Qwen3 14B via NextBitnextbit | Input added at $0.10 · review | 2026-06-08 |
| Qwen: Qwen3 30B A3B via DeepInfradeepinfra | Output $0.45 -> $0.50 · +11.1% | 2026-06-08 |
| Qwen: Qwen3 30B A3B via DeepInfradeepinfra | Input $0.090 -> $0.12 · +33.3% | 2026-06-08 |
| Qwen: Qwen3 30B A3B via Novitanovita | Output removed from $0.45 · review | 2026-06-08 |
| Qwen: Qwen3 30B A3B via Novitanovita | Input removed from $0.090 · review | 2026-06-08 |
| Qwen: Qwen3 30B A3B via OpenRouteropenrouter | Output $0.45 -> $0.50 · +11.1% | 2026-06-08 |
| Qwen: Qwen3 30B A3B via OpenRouteropenrouter | Input $0.090 -> $0.12 · +33.3% | 2026-06-08 |
| Google: Gemini 2.5 Pro via Googlegoogle-vertex-eu | Cache write added at $0.38 · review | 2026-06-08 |
| Google: Gemini 2.5 Pro via Googlegoogle-vertex-eu | Cached tokens added at $0.13 · review | 2026-06-08 |
| Google: Gemini 2.5 Pro via Googlegoogle-vertex-eu | Output added at $10.00 · review | 2026-06-08 |
| Google: Gemini 2.5 Pro via Googlegoogle-vertex-eu | Input added at $1.25 · review | 2026-06-08 |
| Google: Gemini 2.5 Flash via Googlegoogle-vertex | Cache write added at $0.083 · review | 2026-06-08 |
| Google: Gemini 2.5 Flash via Googlegoogle-vertex | Cached tokens added at $0.030 · review | 2026-06-08 |
| Google: Gemini 2.5 Flash via Googlegoogle-vertex | Output added at $2.50 · review | 2026-06-08 |
| Google: Gemini 2.5 Flash via Googlegoogle-vertex | Input added at $0.30 · review | 2026-06-08 |
| Qwen: Qwen3 235B A22B Instruct 2507 via Together AItogether | Output removed from $0.60 · review | 2026-06-08 |
| Qwen: Qwen3 235B A22B Instruct 2507 via Together AItogether | Input removed from $0.20 · review | 2026-06-08 |
| Qwen: Qwen3 235B A22B Instruct 2507 via DeepInfradeepinfra | Input $0.071 -> $0.090 · +26.8% | 2026-06-08 |
| Qwen: Qwen3 235B A22B Instruct 2507 via OpenRouteropenrouter | Input $0.071 -> $0.090 · +26.8% | 2026-06-08 |
| Qwen: Qwen3 Coder 480B A35B via Together AItogether | Output removed from $2.00 · review | 2026-06-08 |
| Qwen: Qwen3 Coder 480B A35B via Together AItogether | Input removed from $2.00 · review | 2026-06-08 |
| Qwen: Qwen3 Coder 480B A35B (free) via Venicevenice-beta | Output removed from $0 · review | 2026-06-08 |
| Qwen: Qwen3 Coder 480B A35B (free) via Venicevenice-beta | Input removed from $0 · review | 2026-06-08 |
| Z.ai: GLM 4.5 Air via 硅基流动siliconflow | Output added at $0.86 · review | 2026-06-08 |
| Z.ai: GLM 4.5 Air via 硅基流动siliconflow | Input added at $0.14 · review | 2026-06-08 |
| Qwen: Qwen3 30B A3B Instruct 2507 via StreamLakestreamlake | Output $0.17 -> $0.19 · +12.5% | 2026-06-08 |
| Qwen: Qwen3 30B A3B Instruct 2507 via StreamLakestreamlake | Input $0.043 -> $0.048 · +12.5% | 2026-06-08 |
| Qwen: Qwen3 30B A3B Instruct 2507 via OpenRouteropenrouter | Output $0.17 -> $0.19 · +12.5% | 2026-06-08 |
| Qwen: Qwen3 30B A3B Instruct 2507 via OpenRouteropenrouter | Input $0.043 -> $0.048 · +12.5% | 2026-06-08 |
| Qwen: Qwen3 Coder 30B A3B Instruct via 硅基流动siliconflow | Output removed from $0.28 · review | 2026-06-08 |
| Qwen: Qwen3 Coder 30B A3B Instruct via 硅基流动siliconflow | Input removed from $0.070 · review | 2026-06-08 |
| OpenAI: gpt-oss-20b via Amazon Bedrockamazon-bedrock | Output added at $0.15 · review | 2026-06-08 |
| OpenAI: gpt-oss-20b via Amazon Bedrockamazon-bedrock | Input added at $0.070 · review | 2026-06-08 |
| OpenAI: gpt-oss-20b via 硅基流动siliconflow | Output added at $0.18 · review | 2026-06-08 |
| OpenAI: gpt-oss-20b via 硅基流动siliconflow | Input added at $0.040 · review | 2026-06-08 |
| OpenAI: gpt-oss-20b via Phalaphala | Output removed from $0.15 · review | 2026-06-08 |
| OpenAI: gpt-oss-20b via Phalaphala | Input removed from $0.040 · review | 2026-06-08 |
| OpenAI: gpt-oss-20b via DekaLLMdekallm | Output removed from $0.14 · review | 2026-06-08 |
| OpenAI: gpt-oss-20b via DekaLLMdekallm | Input removed from $0.029 · review | 2026-06-08 |
| OpenAI: gpt-oss-120b via SambaNovasambanova | Output added at $0.95 · review | 2026-06-08 |
| OpenAI: gpt-oss-120b via SambaNovasambanova | Input added at $0.14 · review | 2026-06-08 |
| OpenAI: gpt-oss-120b via Ambientambient | Cached tokens removed from $0.075 · review | 2026-06-08 |
| OpenAI: gpt-oss-120b via Ambientambient | Output removed from $0.60 · review | 2026-06-08 |
| OpenAI: gpt-oss-120b via Ambientambient | Input removed from $0.15 · review | 2026-06-08 |
| OpenAI: gpt-oss-120b via DigitalOceandigitalocean | Output $0.70 -> $0.53 · -25.0% | 2026-06-08 |
| OpenAI: gpt-oss-120b via DigitalOceandigitalocean | Input $0.10 -> $0.075 · -25.0% | 2026-06-08 |
| OpenAI: GPT-5 via OpenAIopenai-default | Cached tokens added at $0.13 · review | 2026-06-08 |
| OpenAI: GPT-5 via OpenAIopenai-default | Output added at $10.00 · review | 2026-06-08 |
Price history is a routing signal, not a benchmark
Price movement is most useful when combined with model fit, context window, output volume, provider region, and billing constraints. A cheaper input token rate can still lose when your workload has long outputs, heavy cached prompts, or a provider requirement that makes another route operationally simpler.
Start with the history log to spot fresh changes, open individual model pages for source URLs and confidence labels, then use the cost calculator to estimate monthly spend from your own token volume.
A logged change can mean a provider changed the input price, output price, cached-token rate, cache-write rate, or supporting metadata in the latest generated dataset. Treat each entry as a prompt to verify the provider source before changing production routing rules, especially when a row is marked low confidence or when the model family has multiple provider aliases.
For launch planning, the most important pattern is not a single cheap row. Look for stable provider coverage, recent observation dates, and a clear fallback route. If a model appears through OpenRouter and an official endpoint, compare both pages, then keep a note of the source timestamp used for your estimate.
Before a production release, pair this page with the sitemap and robots checks. The history page should stay crawlable, the playground should remain out of the sitemap, and every model URL in the current sitemap should return a clean static page. That keeps price updates discoverable without exposing experimental surfaces.