Usage-based pricing
Usage-based pricing is a model where the customer is charged for measured consumption such as tokens, API calls or minutes, whether or not that consumption produced a result.
Usage-based pricing is a billing model where the customer is charged for measured consumption of the vendor's service. The meter counts tokens, API calls, minutes, messages or compute time. The customer pays for what they consume, whether or not that consumption produced a result.
How it differs from other models
Seat pricing charges for access. Usage pricing charges for activity. Outcome-based pricing charges for results. The first two price the vendor's inputs; only the last prices the customer's value. Usage pricing sits in the middle: fairer than seats because idle customers pay less, but the customer still carries the risk of paying for activity that delivers nothing. A long, unhelpful conversation costs more than a short, successful one.
Where it breaks for AI agents
For AI agents the meter often points the wrong way. Voice agents billed per minute earn more when calls run long, which is the opposite of what the customer wants. And when a vendor makes its agent more efficient, measured usage drops and revenue shrinks as the product improves.
That is why agent vendors move toward hybrid models: a usage or platform floor for predictable revenue, plus outcome pricing for the results that matter. The market's current mix is covered in AI Agent Pricing in 2026.
Related
Resolution layer
A resolution layer is the billing infrastructure that turns an AI agent's event stream into settled, auditable billable outcomes: evaluating conditions, managing settlement and preserving evidence.
Attribution
Attribution is how a billing system decides how much of a confirmed result is credited to an AI agent, expressed as the billing unit that gets multiplied by the price.