Open Models ROI Calculator

How the math works

No black box. Here is exactly how the calculator turns your monthly spend into a projected savings figure on Fireworks.

Step 1

Each model is billed on input, cached input, and output tokens (per 1M). The cache hit rate comes from your workload type.

cost/req = (in × (1 − cache) × inPrice + in × cache × cachedPrice + out × outPrice) ÷ 1,000,000

Step 2

We back out how many requests your current spend buys at the closed model's cost per request.

requests/mo = monthlySpend ÷ closedCostPerReq

Step 3

Hold that volume fixed, re-price it on the Fireworks model, and take the difference.

fwSpend = requests × fwCostPerReq savings = monthlySpend − fwSpend percent = savings ÷ monthlySpend × 100 annual = savings × 12

Assumption	Value	Why
Cache hit rate — Chat & assistants	50%	Long shared system prompt / context reused across turns
Cache hit rate — Document processing	10%	Mostly unique input per request, little reuse
Cache hit rate — Agentic workflows	80%	Many chained calls share a growing, stable prefix
Fireworks pricing	Standard path	Published per-token serverless pricing (input / cached / output per 1M)
Models without a cached rate	cached = input	No prompt-cache discount assumed for that model