Paste your prompt — instantly compare token costs across GPT-4o, Claude, Gemini, DeepSeek & more.
Estimate response length (short ~100, paragraph ~300, article ~2000)
| Provider | Model | Input / request | Output / request | Per Request | Monthly Cost↑ |
|---|---|---|---|---|---|
| CHEAPESTMistral | Nemo | $0.00 | $0.000010 | $0.000010 | $0.3000 |
| Groq | Llama 3.1 8B | $0.00 | $0.000040 | $0.000040 | $1.20 |
| AWS Bedrock | Nova Micro | $0.00 | $0.000070 | $0.000070 | $2.10 |
| AWS Bedrock | Nova Lite | $0.00 | $0.000120 | $0.000120 | $3.60 |
| DeepSeek | V3 | $0.00 | $0.000140 | $0.000140 | $4.20 |
| Gemini Flash-Lite | $0.00 | $0.000150 | $0.000150 | $4.50 | |
| Gemini Flash | $0.00 | $0.000200 | $0.000200 | $6.00 | |
| xAI | Grok 2 Mini | $0.00 | $0.000200 | $0.000200 | $6.00 |
| OpenAI | GPT-4o mini | $0.00 | $0.000300 | $0.000300 | $9.00 |
| Mistral | Small | $0.00 | $0.000300 | $0.000300 | $9.00 |
| Cohere | Command R | $0.00 | $0.000300 | $0.000300 | $9.00 |
| AWS Bedrock | Llama 3.3 70B | $0.00 | $0.000360 | $0.000360 | $10.80 |
| Groq | Llama 3.3 70B | $0.00 | $0.000395 | $0.000395 | $11.85 |
| AWS Bedrock | Claude Haiku 3 | $0.00 | $0.000625 | $0.000625 | $18.75 |
| Mistral | Medium | $0.00 | $0.001000 | $0.001000 | $30.00 |
| DeepSeek | R1 | $0.00 | $0.001095 | $0.001095 | $32.85 |
| AWS Bedrock | Nova Pro | $0.00 | $0.001600 | $0.001600 | $48.00 |
| OpenAI | o1-mini | $0.00 | $0.002200 | $0.002200 | $66.00 |
| Anthropic | Claude Haiku 4.5 | $0.00 | $0.002500 | $0.002500 | $75.00 |
| Mistral | Large | $0.00 | $0.003000 | $0.003000 | $90.00 |
| OpenAI | GPT-5 | $0.00 | $0.005000 | $0.005000 | $150.00 |
| OpenAI | GPT-4o | $0.00 | $0.005000 | $0.005000 | $150.00 |
| xAI | Grok 2 | $0.00 | $0.005000 | $0.005000 | $150.00 |
| Cohere | Command R+ | $0.00 | $0.005000 | $0.005000 | $150.00 |
| AWS Bedrock | Mistral Large | $0.00 | $0.006000 | $0.006000 | $180.00 |
| Gemini Pro | $0.00 | $0.006000 | $0.006000 | $180.00 | |
| Anthropic | Claude Sonnet 4.6 | $0.00 | $0.007500 | $0.007500 | $225.00 |
| AWS Bedrock | Claude Sonnet 3.5 | $0.00 | $0.007500 | $0.007500 | $225.00 |
| OpenAI | o3 | $0.00 | $0.0200 | $0.0200 | $600.00 |
| OpenAI | o1 | $0.00 | $0.0300 | $0.0300 | $900.00 |
| Anthropic | Claude Opus 4.6 | $0.00 | $0.0375 | $0.0375 | $1,125.00 |
* Prices per 1M tokens as of March 2026. Verify with provider before production use.
This calculator uses the standard 4-characters-per-token approximation used by most LLM providers. For English prose, accuracy is typically ±10%. Code, JSON, and non-English text are more token-dense and may have 15–20% more tokens than estimated.
Output tokens require the model to generate each token sequentially, which is computationally more expensive than reading input tokens in parallel. Output costs are typically 3–10× higher than input costs.
For most customer support use cases with short messages, GPT-4o mini ($0.15/$0.60 per 1M), Claude Haiku 4.5 ($1.00/$5.00), or Gemini Flash-Lite ($0.075/$0.30) offer the best cost-to-quality ratio. Use the table above sorted by Total Cost to compare.
Prices are updated manually based on official provider pricing pages as of March 2026. LLM pricing changes frequently — always verify against the provider's official pricing page before making budget decisions.
Key strategies include: (1) Use a smaller model like GPT-4o mini or Gemini Flash for simpler tasks, (2) Enable Batch APIs for 50% off when latency is not critical, (3) Enable prompt caching to save up to 90% on repeated system prompts, (4) Compress your prompts, (5) Use structured output to reduce verbosity.
AWS Bedrock lets you run foundation models (Claude, Llama, Mistral, etc.) within AWS's infrastructure. For Claude models, prices typically match Anthropic direct. For Amazon-native models like Nova Micro, Bedrock can be 80–90% cheaper than GPT-4o.
Multiply (input tokens + output tokens) per request × requests per day × 30 × price per 1M tokens. This calculator does that automatically — just paste a sample prompt and set your expected output length and daily volume.
It depends on the model tier. GPT-4o mini ($0.15 input / $0.60 output per 1M tokens) is cheaper than Claude Sonnet 4.5 ($3/$15 per 1M tokens), but Claude Haiku 4.5 ($1/$5) sits in between. Use this calculator with your actual token counts to compare.
Ahmedabad
B-714, K P Epitome, near Dav International School, Makarba, Ahmedabad, Gujarat 380051
+91 99747 29554
Mumbai
C-20, G Block, WeWork, Enam Sambhav, Bandra-Kurla Complex, Mumbai, Maharashtra 400051
+91 99747 29554
Stockholm
Bäverbäcksgränd 10 12462 Bandhagen, Stockholm, Sweden.
+46 72789 9039

Malaysia
Level 23-1, Premier Suite One Mont Kiara, No 1, Jalan Kiara, Mont Kiara, 50480 Kuala Lumpur