Question 1

What's the cheapest LLM?

Accepted Answer

DeepSeek V3.2 at $0.14/$0.28 per million tokens — but it's a smaller model. Among flagships, Claude Sonnet 4.6 at $3/$15 offers the best capability-per-dollar.

Question 2

What's the most expensive LLM?

Accepted Answer

GPT-5.5 at $5/$30 (output dominates), then Claude Opus 4.7 at $5/$25.

Question 3

How does prompt caching change the math?

Accepted Answer

Dramatically. For production workloads with stable system prompts (60-90% of tokens cacheable), caching can cut input costs by 80%+.

Question 4

What's the Batch API?

Accepted Answer

OpenAI, Anthropic, and Google all offer 50% off for async batch processing (results within 24 hours). Massively underused.

Question 5

Which LLM pricing calculator is most accurate?

Accepted Answer

We use the vendors' official tokenizers (tiktoken for OpenAI, Anthropic's count_tokens API, Google's countTokens API) instead of character approximations. For Llama and DeepSeek, no free count API exists, so we use calibrated character approximations and flag those as estimated.

Calculate LLM API costs across all models

Frequently asked questions

Calculate LLM API costs across all models

Frequently asked questions

Related calculators