Question 1

How do AI subscription limits work?

Accepted Answer

Across the major AI vendors in 2026 there are four distinct metering models: (1) vendor-relative usage buckets that describe limits only as 'more', '5x', or '20x' with no published number (Claude, ChatGPT, Mistral, Perplexity, Grok); (2) time-windowed refresh caps that reset on a clock, such as Gemini's 5-hour refresh up to a weekly ceiling; (3) per-user credit budgets where the price buys a fixed dollar or credit allowance and overage is billed per use (Cursor, GitHub Copilot, Replit, v0); and (4) workspace-pooled credits shared across all users in a workspace rather than per seat (Lovable).

Question 2

Why don't AI companies publish exact usage limits?

Accepted Answer

Most don't because compute-based limits let them adjust capacity without changing the advertised plan, and because a vague 'more usage' claim is easier to maintain than a hard number that customers will hold them to. Among the majors, Google's Gemini is the exception that publishes mechanics (a 5-hour refresh window, a weekly cap, and concrete Flow-credit counts); the credit-budget vendors publish a dollar figure but not how far it stretches.

Question 3

What is the difference between a usage bucket and a credit budget?

Accepted Answer

A usage bucket is a relative allowance ('5x more than Free') that the vendor meters internally and you cannot see; you find the ceiling by hitting it. A credit budget is an explicit dollar or credit amount included each month (for example GitHub Copilot's $15/$70/$200 monthly credits, or Replit and v0's included Agent credits), after which you are billed per use. A budget is more transparent about the amount but ties the limit to model pricing, so the same budget buys less on a more expensive model.

Question 4

What are workspace-pooled credits?

Accepted Answer

Workspace-pooled credits are a single monthly credit allowance shared across every user in a workspace, rather than allocated per seat. Lovable uses this model: its Pro and Business tiers both include the same 100 monthly credits for the whole workspace, so the higher tier buys governance and controls rather than more capacity. This is the rarest of the four models in 2026.

Question 5

Which AI plans tell you their actual limits?

Accepted Answer

Almost none give a single hard number for general usage. Google's Gemini publishes the most: a 5-hour refresh window, a weekly cap, and Flow-credit counts (200/1,000/10,000+). The credit-budget vendors (GitHub Copilot, Replit, v0, Cursor) publish the dollar/credit amount included. Everyone else describes limits only relative to a lower tier. QuotaLedger records the exact wording each vendor uses and the date it was checked.

Model	How the cap works	Who uses it	What you can't know
1. Vendor-relative usage bucket	An internal allowance described only as "more", "5x", or "20x" a lower tier. Metered by the vendor; invisible to you.	Claude, ChatGPT, Mistral/Vibe, Perplexity, Grok	The actual number. You discover the ceiling by hitting it.
2. Time-windowed refresh cap	Usage resets on a clock (e.g. every 5 hours) up to a weekly ceiling. The shape of the limit is published even when the exact units aren't.	Gemini (5-hour refresh + weekly cap), Devin (daily + weekly refresh)	The compute units per window, usually. Gemini adds hard Flow-credit counts.
3. Per-user credit budget	The price buys a fixed dollar/credit allowance each month; once spent, you're billed per use (overage).	Cursor, GitHub Copilot ($15/$70/$200 credits), Replit, v0, Bolt (token budget)	How far the budget stretches — it depends on which model you run and its per-token rate.
4. Workspace-pooled credits	A single monthly credit pool shared across all users in a workspace, not allocated per seat.	Lovable (Pro & Business both 100 credits)	Per-person usage — the pool drains collectively, so a busy teammate can spend yours.

How AI subscription limits actually work: the 4 metering models (2026)

The four models at a glance

Model 1: the vendor-relative usage bucket

Model 2: the time-windowed refresh cap

Model 3: the per-user credit budget

Model 4: workspace-pooled credits

So how do you compare plans?

FAQ