Definition
Rent powerful AI by the call: model owners sell access through APIs, so any business can add intelligence and pay only for what it uses.
At a glance
- You call a ready-made model (GPT, Claude) instead of building one — no AI team needed.
- Billing is per use, measured in tokens. Claude Opus 4.7 runs ~$5 per million words in, ~$25 out.
- A few providers dominate: by mid-2025 enterprise use, Anthropic ~32%, OpenAI ~25%, Google ~20%.
- Usage-based pricing scales with demand, so costs can spike fast.
How it works
A handful of providers do the costly work of building the model, then expose it through an API your software calls over the internet. Your app sends a request, gets an answer, and pays for that call[4]. Pricing is per token, with output costing more[2]. This let API spending more than double in under a year, to ~$8.4B by mid-2025.
Why it matters
Caching (~90% off) and batch jobs (~50% off) help, but spend still climbs as adoption grows[2]. The moat is rarely the model everyone can rent — it’s what you wrap around it, the way Aircall built a big business on Twilio[6]. Enterprises seldom switch vendors but upgrade fast when a stronger model ships[1].
Bottom line
Intelligence becomes a utility you rent by the call — winners pick the right provider, watch token spend, and build a real product around the model.
References
- 2025 Mid-Year LLM Market Update: Foundation Model Landscape + Economics. Menlo Ventures menlovc.com
- Anthropic API Pricing in 2026: Complete Guide — Models, Caching, Batch & Optimization. Finout www.finout.io
- AI Inference Market Size, Share & Growth, 2025 To 2030. MarketsandMarkets www.marketsandmarkets.com
- The API Economy in the Age of AI: State of the Market Report 2025. apidays www.apidays.global
- The AI Token Pricing Crisis Behind OpenAI and Anthropic's Revenue Race. Investing.com www.investing.com
- The misunderstood AI Wrapper Opportunity — Alvaro Vargas. Medium medium.com
Comments
Questions, corrections, and links welcome. Be specific and civil.